blog.studentcaffe.com
robots.txt

Robots Exclusion Standard data for blog.studentcaffe.com

Resource Scan

Scan Details

Site Domain blog.studentcaffe.com
Base Domain studentcaffe.com
Scan Status Ok
Last Scan2025-05-05T04:12:53+00:00
Next Scan 2025-06-04T04:12:53+00:00

Last Scan

Scanned2025-05-05T04:12:53+00:00
URL http://blog.studentcaffe.com/robots.txt
Domain IPs 192.124.249.9
Response IP 192.124.249.9
Found Yes
Hash 353047b2353461a81f39df8217b46502f13a967118748214c1c446e0f9a20bd7
SimHash 4104cc80ad97

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-content/uploads/ad*
Allow /wp-admin/admin-ajax.php

googlebot-image

Rule Path
Disallow /wp-content/uploads/sites/
Disallow /wp-content/uploads/
Allow /wp-content/uploads/studentcaffe*

Other Records

Field Value
sitemap http://studentcaffe.com/sitemap.xml
sitemap http://blog.studentcaffe.com/sitemap.xml