in.classi4u.com
robots.txt

Robots Exclusion Standard data for in.classi4u.com

Resource Scan

Scan Details

Site Domain in.classi4u.com
Base Domain classi4u.com
Scan Status Ok
Last Scan2025-05-22T05:13:34+00:00
Next Scan 2025-05-29T05:13:34+00:00

Last Scan

Scanned2025-05-22T05:13:34+00:00
URL https://in.classi4u.com/robots.txt
Domain IPs 35.190.86.134
Response IP 35.190.86.134
Found Yes
Hash 3d707c17b592970d4789156171bc3a2c2b3ae16d68ee3d0869fe6e8d69c02592
SimHash 5f1ee17aa817

Groups

*

Rule Path
Disallow /content/
Disallow /common/
Disallow /classi/
Disallow /xmldata/
Disallow /forums/
Disallow /videos/
Disallow /api/
Disallow /access/
Disallow /actions/

crazywebcrawler-spider
zing-bottabot*
emailcollector
emailsiphon
emailwolf
crawlera
wget
nutch
kikbot
voltron
ahrefsbot
claudebot
amazonbot
anthropic-ai
applebot-extended
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
diffbot
facebookbot
friendlycrawler
google-extended
gptbot
imagesiftbot
img2dataset
meta-externalagent
oai-searchbot
omgili
omgilibot
perplexitybot
timpibot
velenpublicwebcrawler
youbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.classi4u.com/sitemap-index.xml

Comments

  • Bot, bot, go away...