yiff.life
robots.txt

Robots Exclusion Standard data for yiff.life

Resource Scan

Scan Details

Site Domain yiff.life
Base Domain yiff.life
Scan Status Ok
Last Scan2024-10-04T02:59:07+00:00
Next Scan 2024-10-05T02:59:07+00:00

Last Scan

Scanned2024-10-04T02:59:07+00:00
URL https://yiff.life/robots.txt
Domain IPs 104.26.4.121, 104.26.5.121, 172.67.71.46, 2606:4700:20::681a:479, 2606:4700:20::681a:579, 2606:4700:20::ac43:472e
Response IP 104.26.5.121
Found Yes
Hash 146a8bced4d611b51e23b57008b1a67918dceb80721c4c58623852aa47c97cd4
SimHash a2a52c8d6140

Groups

gptbot

Rule Path
Disallow /

*

Rule Path
Disallow /media_proxy/
Disallow /interact/

ia_archiver

Rule Path
Disallow /

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:
  • User-agent: *
  • Disallow: /