thebirdman.org
robots.txt

Robots Exclusion Standard data for thebirdman.org

Resource Scan

Scan Details

Site Domain thebirdman.org
Base Domain thebirdman.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-10-31T12:47:58+00:00
Next Scan 2025-12-30T12:47:58+00:00

Last Successful Scan

Scanned2025-08-10T00:33:06+00:00
URL https://thebirdman.org/robots.txt
Domain IPs 104.21.76.138, 172.67.195.192, 2606:4700:3032::ac43:c3c0, 2606:4700:3034::6815:4c8a
Response IP 172.67.195.192
Found Yes
Hash 8909d848d1d305e30452491e10654df9244936906ebee26d7e369e1032322193
SimHash 3c1179526380

Groups

amazonbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /
Disallow /

Comments

  • NOTICE: The collection of content and other data on this
  • site through automated means, including any device, tool,
  • or process designed to data mine or scrape content, is
  • prohibited except (1) for the purpose of search engine indexing or
  • artificial intelligence retrieval augmented generation or (2) with express
  • written permission from this site’s operator.
  • To request permission to license our intellectual
  • property and/or other materials, please contact this
  • site’s operator directly.
  • BEGIN Cloudflare Managed content
  • END Cloudflare Managed Content

Warnings

  • 1 invalid line.
  • `!duser-agent` is not a known field.