staging2.horseracingnation.com
robots.txt
Robots Exclusion Standard data for staging2.horseracingnation.com
Resource Scan
Scan Details
Site Domain | staging2.horseracingnation.com |
Base Domain | horseracingnation.com |
Scan Status | Ok |
Last Scan | 2024-11-09T04:28:52+00:00 |
Next Scan | 2024-11-16T04:28:52+00:00 |
Last Scan
Scanned | 2024-11-09T04:28:52+00:00 |
URL | https://staging2.horseracingnation.com/robots.txt |
Redirect | https://www.staging2.horseracingnation.com:443/robots.txt |
Redirect Domain | www.staging2.horseracingnation.com |
Redirect Base | horseracingnation.com |
Domain IPs | 18.210.67.184, 3.223.229.93, 34.201.199.117, 34.225.231.135 |
Redirect IPs | 18.210.67.184, 3.223.229.93, 34.201.199.117, 34.225.231.135 |
Response IP | 18.210.67.184 |
Found | Yes |
Hash | d8ce5baa8e04b0917cab5c2a6c0cefe5a46d84f178bef2932533437942ee7309 |
SimHash | 305a4bc1c764 |
Groups
*
Rule | Path |
---|---|
Disallow | /edit |
Disallow | /forgotpassword.aspx |
Disallow | /Login.aspx |
Disallow | /login.aspx |
Disallow | /signup.aspx |
Disallow | /session |
Disallow | /terms |
ai2bot
ai2bot-dolma
amazonbot
applebot
applebot-extended
bytespider
ccbot
chatgpt-user
claude-web
claudebot
diffbot
facebookbot
friendlycrawler
gptbot
google-extended
googleother
googleother-image
googleother-video
icc-crawler
imagesiftbot
meta-externalagent
meta-externalfetcher
oai-searchbot
perplexitybot
petalbot
scrapy
the knowledge ai
timpibot
velenpublicwebcrawler
webzio-extended
youbot
anthropic-ai
cohere-ai
facebookexternalhit
iaskspider/2.0
img2dataset
omgili
omgilibot
Rule | Path |
---|---|
Disallow | / |
Comments