allrugby.com
robots.txt

Robots Exclusion Standard data for allrugby.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	allrugby.com
Base Domain	allrugby.com
Scan Status	Ok
Last Scan	2024-10-29T23:36:11+00:00
Next Scan	2024-11-05T23:36:11+00:00

Last Scan

Scanned	2024-10-29T23:36:11+00:00
URL	https://allrugby.com/robots.txt
Redirect	https://www.allrugby.com/robots.txt
Redirect Domain	www.allrugby.com
Redirect Base	allrugby.com
Domain IPs	193.70.63.11
Redirect IPs	193.70.63.11
Response IP	193.70.63.11
Found	Yes
Hash	a26f95bd7e68c7d1f3ec1fecff3668105e4bc715b5983eb91c5237a20a4b520f
SimHash	70375951c1c4

Groups

pompos

Rule	Path
Disallow	/

Rule

Path

Disallow

/

turnitinbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

quepasacreep

Rule	Path
Disallow	/

Rule

Path

Disallow

/

jetbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

ai2bot
ai2bot-dolma
amazonbot
applebot
applebot-extended
bytespider
ccbot
chatgpt-user
claude-web
claudebot
diffbot
facebookbot
friendlycrawler
gptbot
google-extended
googleother
googleother-image
googleother-video
icc-crawler
imagesiftbot
meta-externalagent
meta-externalfetcher
oai-searchbot
perplexitybot
petalbot
scrapy
timpibot
velenpublicwebcrawler
webzio-extended
youbot
anthropic-ai
cohere-ai
facebookexternalhit
iaskspider/2.0
img2dataset
omgili
omgilibot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	https://www.allrugby.com/sitemap.xml

Field

Value

sitemap

https://www.allrugby.com/sitemap.xml

Back to top

allrugby.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

pompos

turnitinbot

quepasacreep

jetbot

Other Records

allrugby.com
robots.txt