0471168.com
robots.txt

Robots Exclusion Standard data for 0471168.com

Resource Scan

Scan Details

Site Domain 0471168.com
Base Domain 0471168.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-04-24T02:43:51+00:00
Next Scan 2024-07-23T02:43:51+00:00

Last Successful Scan

Scanned2023-03-09T02:41:24+00:00
URL http://0471168.com/robots.txt
Domain IPs 122.10.73.142
Response IP 122.10.73.142
Found Yes
Hash 9b36205619dc9d3a7d15cdbe7949eed5c6cdc0cffa5fca8cfad81f4263b56ebf
SimHash 685c59936683

Groups

baiduspider

Rule Path
Disallow

sosospider

Rule Path
Disallow

sogou spider

Rule Path
Disallow

yisouspider

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

yodaobot

Rule Path
Disallow /

googlebot

Rule Path
Disallow /

bingbot

Rule Path
Disallow /

slurp

Rule Path
Disallow /

teoma

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

msnbot

Rule Path
Disallow /

scrubby

Rule Path
Disallow /

robozilla

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

googlebot-image

Rule Path
Disallow /

googlebot-mobile

Rule Path
Disallow /

yahoo-mmcrawler

Rule Path
Disallow /

yahoo-blogs/v3.9

Rule Path
Disallow /

psbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

uptimebot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

mail.ru

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

extlinksbot

Rule Path
Disallow /

aihitbot

Rule Path
Disallow /

researchscan

Rule Path
Disallow /

dnyzbot

Rule Path
Disallow /

spbot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

*

Rule Path
Disallow /index.php

Other Records

Field Value
sitemap /rss/sitemap.xml
sitemap /rss/sitemap.txt
sitemap /rss/baidu.html

Warnings

  • 2 invalid lines.