idou.me
robots.txt

Robots Exclusion Standard data for idou.me

Resource Scan

Scan Details

Site Domain idou.me
Base Domain idou.me
Scan Status Ok
Last Scan2024-11-12T09:56:27+00:00
Next Scan 2024-11-19T09:56:27+00:00

Last Scan

Scanned2024-11-12T09:56:27+00:00
URL https://idou.me/robots.txt
Domain IPs 13.113.166.218, 18.182.106.198
Response IP 13.113.166.218
Found Yes
Hash ad8c88dd11d2495a96e68456f3f50735de04f3d31938312862672c0e7f9ff6f9
SimHash e2dc4c05e35b

Groups

*

Rule Path
Disallow /external_link/
Disallow /clip
Disallow /history
Disallow /budget

bingbot

Rule Path
Disallow /external_link/
Disallow /clip
Disallow /history
Disallow /budget

Other Records

Field Value
crawl-delay 5

criteobot/0.1

Rule Path
Disallow /external_link/
Disallow /clip
Disallow /history
Disallow /budget

Other Records

Field Value
crawl-delay 2

ahrefsbot

Rule Path
Disallow /external_link/
Disallow /clip
Disallow /history
Disallow /budget

Other Records

Field Value
crawl-delay 5

microadbot

Rule Path
Disallow /external_link/
Disallow /clip
Disallow /history
Disallow /budget

Other Records

Field Value
crawl-delay 5

dataforseobot

Rule Path
Disallow /external_link/
Disallow /clip
Disallow /history
Disallow /budget

Other Records

Field Value
crawl-delay 5

blexbot

Rule Path
Disallow /external_link/
Disallow /clip
Disallow /history
Disallow /budget

Other Records

Field Value
crawl-delay 5

y!j

Rule Path
Disallow /external_link/
Disallow /clip
Disallow /history
Disallow /budget

Other Records

Field Value
crawl-delay 5

imagesiftbot

Rule Path
Disallow /external_link/
Disallow /clip
Disallow /history
Disallow /budget

Other Records

Field Value
crawl-delay 10

facebookexternalhit

Rule Path
Disallow /external_link/
Disallow /clip
Disallow /history
Disallow /budget

Other Records

Field Value
crawl-delay 10

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://idou.me/sitemap/sitemap.xml.gz

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:
  • User-agent: *
  • Disallow: /