classi4u.com
robots.txt
Robots Exclusion Standard data for classi4u.com
Resource Scan
Scan Details
| Site Domain | classi4u.com |
| Base Domain | classi4u.com |
| Scan Status | Ok |
| Last Scan | 2025-12-20T09:32:07+00:00 |
| Next Scan | 2025-12-27T09:32:07+00:00 |
Last Scan
| Scanned | 2025-12-20T09:32:07+00:00 |
| URL | https://classi4u.com/robots.txt |
| Redirect | https://www.classi4u.com/robots.txt |
| Redirect Domain | www.classi4u.com |
| Redirect Base | classi4u.com |
| Domain IPs | 35.190.86.134 |
| Redirect IPs | 35.190.86.134 |
| Response IP | 35.190.86.134 |
| Found | Yes |
| Hash | 3d707c17b592970d4789156171bc3a2c2b3ae16d68ee3d0869fe6e8d69c02592 |
| SimHash | 5f1ee17aa817 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /content/ |
| Disallow | /common/ |
| Disallow | /classi/ |
| Disallow | /xmldata/ |
| Disallow | /forums/ |
| Disallow | /videos/ |
| Disallow | /api/ |
| Disallow | /access/ |
| Disallow | /actions/ |
crazywebcrawler-spider
zing-bottabot*
emailcollector
emailsiphon
emailwolf
crawlera
wget
nutch
kikbot
voltron
ahrefsbot
claudebot
amazonbot
anthropic-ai
applebot-extended
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
diffbot
facebookbot
friendlycrawler
google-extended
gptbot
imagesiftbot
img2dataset
meta-externalagent
oai-searchbot
omgili
omgilibot
perplexitybot
timpibot
velenpublicwebcrawler
youbot
| Rule | Path |
|---|---|
| Disallow | / |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.classi4u.com/sitemap-index.xml |
Comments