www.staging2.horseracingnation.com
robots.txt

Robots Exclusion Standard data for www.staging2.horseracingnation.com

Resource Scan

Scan Details

Site Domain www.staging2.horseracingnation.com
Base Domain horseracingnation.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonRequest timed out.
Last Scan2024-05-26T08:24:18+00:00
Next Scan 2024-06-25T08:24:18+00:00

Last Successful Scan

Scanned2024-04-04T08:12:27+00:00
URL https://www.staging2.horseracingnation.com/robots.txt
Domain IPs 23.20.16.252, 44.197.33.128, 44.216.6.24, 54.224.30.57
Response IP 44.216.6.24
Found Yes
Hash bdaeea083f7c09e6e699c11df340ba48c7aa43dff86a27660ec754ac00da363f
SimHash 30905dc1a464

Groups

*

Rule Path
Disallow /edit
Disallow /forgotpassword.aspx
Disallow /Login.aspx
Disallow /login.aspx
Disallow /signup.aspx
Disallow /session
Disallow /terms

amazonbot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

the knowledge ai

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

youbot

Rule Path
Disallow /

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file
  • FAANG
  • Google Bard AI
  • Microsoft / OpenAI
  • Others
  • Common Crawl