in.classi4u.com
robots.txt

Robots Exclusion Standard data for in.classi4u.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	in.classi4u.com
Base Domain	classi4u.com
Scan Status	Ok
Last Scan	2026-01-22T12:48:28+00:00
Next Scan	2026-01-29T12:48:28+00:00

Last Scan

Scanned	2026-01-22T12:48:28+00:00
URL	https://in.classi4u.com/robots.txt
Domain IPs	35.190.86.134
Response IP	35.190.86.134
Found	Yes
Hash	3d707c17b592970d4789156171bc3a2c2b3ae16d68ee3d0869fe6e8d69c02592
SimHash	5f1ee17aa817

Groups

*

Rule	Path
Disallow	/content/
Disallow	/common/
Disallow	/classi/
Disallow	/xmldata/
Disallow	/forums/
Disallow	/videos/
Disallow	/api/
Disallow	/access/
Disallow	/actions/

Rule

Path

Disallow

/content/

Disallow

/common/

Disallow

/classi/

Disallow

/xmldata/

Disallow

/forums/

Disallow

/videos/

Disallow

/api/

Disallow

/access/

Disallow

/actions/

crazywebcrawler-spider
zing-bottabot*
emailcollector
emailsiphon
emailwolf
crawlera
wget
nutch
kikbot
voltron
ahrefsbot
claudebot
amazonbot
anthropic-ai
applebot-extended
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
diffbot
facebookbot
friendlycrawler
google-extended
gptbot
imagesiftbot
img2dataset
meta-externalagent
oai-searchbot
omgili
omgilibot
perplexitybot
timpibot
velenpublicwebcrawler
youbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	https://www.classi4u.com/sitemap-index.xml

Field

Value

sitemap

https://www.classi4u.com/sitemap-index.xml

Back to top

Comments

Bot, bot, go away...

Back to top

in.classi4u.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

Comments

in.classi4u.com
robots.txt