biqugetxt.cc
robots.txt

Robots Exclusion Standard data for biqugetxt.cc

Resource Scan

Scan Details

Site Domain biqugetxt.cc
Base Domain biqugetxt.cc
Scan Status Ok
Last Scan2024-09-29T04:49:30+00:00
Next Scan 2024-10-29T04:49:30+00:00

Last Scan

Scanned2024-09-29T04:49:30+00:00
URL https://biqugetxt.cc/robots.txt
Domain IPs 38.238.196.168
Response IP 38.238.196.168
Found Yes
Hash 0c85317ff544b1f7226e293504b54dff277811f7bfba197987102f48867baa60
SimHash 605555834795

Groups

baiduspider

Rule Path
Disallow

baiduspider-image

Rule Path
Disallow

baiduspider-render

Rule Path
Disallow

bytespider

Rule Path
Disallow

sosospider

Rule Path
Disallow

sogou web spider

Rule Path
Disallow

sogou inst spider

Rule Path
Disallow

sogou spider2

Rule Path
Disallow

sogou news spider

Rule Path
Disallow

sogou orion spider

Rule Path
Disallow

jikespider

Rule Path
Disallow /

yodaobot

Rule Path
Disallow /

googlebot

Rule Path
Disallow

bingbot

Rule Path
Disallow

slurp

Rule Path
Disallow /

teoma

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

msnbot

Rule Path
Disallow /

scrubby

Rule Path
Disallow /

robozilla

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

googlebot-image

Rule Path
Disallow /

googlebot-mobile

Rule Path
Disallow /

yahoo-mmcrawler

Rule Path
Disallow /

yahoo-blogs/v3.9

Rule Path
Disallow /

psbot

Rule Path
Disallow /

*

Rule Path
Disallow

Other Records

Field Value
sitemap /sitemap.xml

Comments

  • ÷¼÷ÃÏÀ×îÇ¿robots Ö»ÔÊÐí°Ù¶È

Warnings

  • 2 invalid lines.