in.classi4u.com
robots.txt
Robots Exclusion Standard data for in.classi4u.com
Resource Scan
Scan Details
Site Domain | in.classi4u.com |
Base Domain | classi4u.com |
Scan Status | Ok |
Last Scan | 2025-05-22T05:13:34+00:00 |
Next Scan | 2025-05-29T05:13:34+00:00 |
Last Scan
Scanned | 2025-05-22T05:13:34+00:00 |
URL | https://in.classi4u.com/robots.txt |
Domain IPs | 35.190.86.134 |
Response IP | 35.190.86.134 |
Found | Yes |
Hash | 3d707c17b592970d4789156171bc3a2c2b3ae16d68ee3d0869fe6e8d69c02592 |
SimHash | 5f1ee17aa817 |
Groups
*
Rule | Path |
---|---|
Disallow | /content/ |
Disallow | /common/ |
Disallow | /classi/ |
Disallow | /xmldata/ |
Disallow | /forums/ |
Disallow | /videos/ |
Disallow | /api/ |
Disallow | /access/ |
Disallow | /actions/ |
crazywebcrawler-spider
zing-bottabot*
emailcollector
emailsiphon
emailwolf
crawlera
wget
nutch
kikbot
voltron
ahrefsbot
claudebot
amazonbot
anthropic-ai
applebot-extended
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
diffbot
facebookbot
friendlycrawler
google-extended
gptbot
imagesiftbot
img2dataset
meta-externalagent
oai-searchbot
omgili
omgilibot
perplexitybot
timpibot
velenpublicwebcrawler
youbot
Rule | Path |
---|---|
Disallow | / |
Other Records
Field | Value |
---|---|
sitemap | https://www.classi4u.com/sitemap-index.xml |
Comments