垃圾UA(userAgent)信息
屏蔽垃圾ua,可以大大节省服务器带宽成本,以下是懒人工具管理员多年的总结和收集的毫无意义的垃圾UA爬虫机器人Bot,拦截掉可以让正规的搜索引擎更快速的抓取你的网站内容.
headless | bai.com | Crawler | Barkrowler | CakePHP |
---|---|---|---|---|
GarlikCrawler | Go-http-client | ias_crawler | ICC-Crawler | PotPlayer |
Riddler | Scrapy | WINAMP | viz/viz | ZXing |
Castro | Jakarta Commons | ltx71 | NativeHost | SalesIntelligent |
Xenu Link Sleuth | Y!J-ASR | BUbiNG | CRAZYWEBCRAWLER | http Cnrdn |
Lavf | NSPlayer | spray-can | stagefright | voltron |
LibVLC | A6-Indexer | crawler4j | wsr-agent | DigitalPebble Crawler |
MBCrawler | AhrefsBot | GrapeshotCrawler | proximic | SemrushBot |
ahoy! | alkaline | ananzi | anthill | arachnophilia |
arale | araneo | aretha | ariadne | arks |
askjeeves | atn worldwide | auresys | backrub | big brother |
bjaaland | blackwidow | bloodhound | calif | cassandra |
christcrawler.com | churl | cienciaficcion.net | cmc/0.01 | collective |
combine system | computingsite robi/1.0 | crawler.feedback | cusco | cyberspyder link test |
katalog/index | die blinde kuh | digger | direct hit grabber | download express |
dwcp | ebiness | e-collector | emacs-w3 search engine | esculapio |
esther | evliya celebi | fastcrawler | felix ide | fetchrover |
fido | fish search | fouineur | freecrawl | funnelweb |
gazz | gcreep | getterroboplus puu | geturl | golem |
grapnel/0.01 experiment | griffon | gromit | Gluten | h?m?h?kki |
harvest | havindex | hi (html index) search | hku www octopus | ht://dig |
html_analyzer | htmlgobble | hyper-decontextualizer | ia_archiver | ibm_planetwide |
image.kapsi.net | imagelock | incywincy | informant | infoseek sidewinder |
ingrid | inktomi slurp | inspector web | intelliagent | internet shinchakubin |
iron33 | israeli-search | javabee | jcrawler | jumpstation |
katipo | kdd-explorer | kilroy | kit-fireball | labelgrabber |
larbin | legs | link validator | linkscan | linkwalker |
lockon | logo.gif crawler | lycos | mac wwwworm | magpie |
marvin/infoseek | mattie | mediafox | merzscope | mindcrawler |
mnogosearch search engine software | moget | monster | motor | muncher |
muninn | muscat ferret | mwd.search | nec-meshexplorer | nederland.zoek |
netcarta webmap engine | netmechanic | netscoop | newscan-online | nhse web forager |
nomad | northern light gulliver | nzexplorer | objectssearch | occam |
OOZBOT | openfind data gatherer | orb search | pack rat | pageboy |
parasite | patric | pegasus | perlcrawler 1.0 | pgp key agent |
phpdig | piltdownman | pioneer | plumtreewebaccessor | poppi |
popular iconoclast | raven search | roadhouse crawling system | robofox | robozilla |
rules | scooter | search.aus-au.com | searchprocess | senrigan |
sg-scout | shagseeker | sift | site searcher | site valet |
sitetech-rover | skymob.com | slcrawler | sleek | snooper |
suke | suntek search engine | sven | sygol | tach black widow |
tarantula | templeton | the peregrinator | the web moose | the web wombat |
the world wide web wanderer | the world wide web worm | titan | titin | ucsd crawl |
udmsearch | unnamed | url check | valkyrie | verticrawl |
victoria | vision-search | voyager | w3m2 | w3mir |
walhello appie | wallpaper (alias crawlpaper) | web core / roots | webcatcher | webcopy |
webfetcher | webinator | weblayers | weblinker | weblog monitor |
webmirror | webquest | webreaper | websnarf | webstolperer |
webvac | webwalk | webwalker | webwatch | webzinger |
wget | whatuseek winona | wild ferret web hopper | wired digital | wwwc ver |
xget | daumoa | jobo | echo! | linkchecker |
bloglines | twiceler | appie | sun4u | httrack |
sisi | robi | webster pro | webster | zeus |
scirus | picosearch | plucker | disco pump | gulliver |
emailsiphon | teleport pro | fetch | pamuk | webcopier |
webcapture | mass downloader | awv0.8d | crescent internet toolpak | webstripper |
sitesucker | webdup | python-urllib | python | franklin locator |
ck-sillydog | pockethttp | java | kototoi.org | teragramwebcrawler |
vagabondo | nogoop-httpclient | myoperatb | myoperatb | accoona-ai-agent |
arachmo | b-l-i-t-z-b-o-t | boitho.com-dc | cerberian drtrs | charlotte |
converacrawler | cosmos | covario ids | dataparksearch | earthcom.info |
fast enterprise crawler | fast-webcrawler | findlinks | g2crawler | holmes |
htdig | iccrawler | ichiro | igdespyder | issuecrawler |
l.webis | lwp-trivial | mabontland | magpie-crawler | mnogosearch |
mogimogi | morning paper | mvaclient | netresearchserver | netseer crawler |
newsgator | ng-search | nutchcvs | nymesis | oegp |
orbiter | peew | pompos | postpost | pycurl |
qseero | radian6 | sandcrawler | sbider | scoutjet |
scrubby | searchsight | seekbot | semanticdiscovery | sensis web crawler |
shim-crawler | shopwiki | snappy | sqworm | stackrambler |
teoma | tineye | truwogps | updated | vortex |
vyu2 | webcollage | websquash.com | wf84 | womlpefactory |
yacy | yahooseeker | yahooseeker-testing | yandeximages | yandexmetrika |
yeti | yooglifetchagent | zyborg | wordpress | a6-indexer |
wsr-agent | Microsoft Office | JDatabaseDriver | facebookexternalhit | The+Knowledge+AI |
Twitterbot | VenusCrawler | aria2 | GetCode | CCBot |
NetTrack | Turnitin | IAS crawler | POE-Component | VelenPublicWebCrawler |
www.ru | Nutch Master Test | Wotbox | orion-semantics.com | lwp-request |
ShortLinkTranslate | mj12bot | WinHttpRequest | Exabot | Auto Spider |
DuckDuckGo | SeznamBot | moatbot | DotBot | SurdotlyBot |
28logsSpider | zgrab | Windows-Media-Player | spbot | Mail.RU_Bot |
Backlink | SiteExplorer | SEOkicks | linkdexbot | Qwantify |
DataXu | ExtLinksBot | gvfs/ | evc-batch | Cliqzbot |
YandexBot | YandexMobileBot | newspaper | Clickagy | Chicken laser |
coccocbot | Microsoft Windows Network Diagnostics | spuhex.com | smtbot | Dataprovider |
HybridBot | Sky-Wapproxy | SafeDNSBot | HatenaBookmark | Meta_Bot |
ToutiaoSpider | HttpComponents | ips-agent | yandex.com/bots | (ziva) |
Jersey | Auto Shell Spider | User-Agent | curl | MPlayer |
internal request | Grammarly | package | TrendsmapResolver | PaperLiBot |
startmebot | WebFuck | GStreamer | httpsrc | AntennaPod |
panscient.com | webscan | Screaming Frog | WFilter Live | trendictionbot |
nsrbot | PlurkBot | Mojolicious | AlphaBot | tracemyfile |
VCTestClient | heritrix | MiniRedir | Iframely | rest-client |
Cappuccino | FirmsBot | BOT for JCE | Nimbostratus-Bot | Emacs-w3m |
WordupinfoSearch | Dispatch | Paracrawl | Mr.4×3 | axios |
Typhoeus | tools.random | WhatCMSBot | InetURL | NetpeakCheckerBot |
Goose | lua-resty | WhatWeb | special_archiver | XoviBot |
Wappalyzer | OK-Search-Bot | abot | Mechanize | uipbot |
GnowitNewsbot | PostmanRuntime | HoneyBee | gobuster | Bidtellect |
Sonos | RankingBot | Uptimebot | Synapse | Re-re Studio |
Mappy | Statastico | Linguee Bot | PocketImageCache | colly |
YunSecurityBot | archive.org_bot | CheckMarkNetwork | Jooblebot | ZoomBot |
Linkbot | Streamline3Bot | LetsearchBot | Linguee-Bot | Thither.Direct |
Bose/ | PPBot | IndeedBot | Everyonedomainsbot | PPBot |
MixnodeCache | NetpeakSpiderBot | TagVisit | RestSharp | Symfony |
Needle | kubectl | vuhuvBot | Staddlebot | ddline.cn |
AdsrvrContextual | _zbot | PagePeeker | OutclicksBot | Kozmosbot |
PicoFeed | Mediatoolkitbot | netdisk | ESP32 | Traackr.com |
Discordbot | PinkBot | Validator | Semantic | aiHitBot |
Zoxh.Com | foobar2000 | bitlybot | beegoServer | MFC_Tear_Sample |
Quantcastbot | HeiKe | ManicTime | News | Windows 95 |
Windows 98 | WebPictures | SBL-BOT | DreamPassport | Blazer |
RealMedia | Liberate DTV | Cyberdog | Fuzz Faster | portalmmm |
WannaBe | bluefish | Utopia WebWasher | Offline Explorer | Visicom |
Barca | ANTFresco | Hotzonu | Wfuzz | Dillo |
iSiloX | Commerce Browser Center | W3CLineMode | Pandalytics | LinkpadBot |
daum.net | NewTV | GigablastOpenSource | MAZBot | pilicanbot |
EchoboxBot | Cincraw | ScraperBot | admantx | AspiegelBot |
BDCbot | LogStatistic | MAZBot | CheckHost | 7Siters |
BorneoBot | Cincraw | HuaweiWebCatBot | PetalBot | ZoominfoBot |
Pinterestbot | MojeekBot | SeoBot | LogStatistic | l9explore |
FMODStudio | Nutch Spider | DomainStats | seostar | omgili |
webprosbot | ThinkChaos | WellKnown | Punkspider | DataForSeo |
Keybot | Baispider | MegaIndex.ru | MauiBot | BLEXBot |
digext | MagiBot | Adsbot | Nmap | vip0.ru |
thetradedesk | Apache-HttpClient | trendkite-akashic-crawler | phantomjs | Amazonbot |
semantic-visions.com | SkypeUriPreview | serpstatbot | PubMatic | \xB2\xBB\xCA\xCA\xD3\xC3UA |
libcurl-agent | Neevabot | Seekport | Linespider | msray-plus |
fidget-spinner-bot | AwarioBot | ImagesiftBot |