cafetarikh.com
robots.txt

Robots Exclusion Standard data for cafetarikh.com

Resource Scan

Scan Details

Site Domain cafetarikh.com
Base Domain cafetarikh.com
Scan Status Ok
Last Scan2025-12-01T21:24:33+00:00
Next Scan 2025-12-31T21:24:33+00:00

Last Scan

Scanned2025-12-01T21:24:33+00:00
URL https://cafetarikh.com/robots.txt
Redirect https://www.cafetarikh.com/robots.txt
Redirect Domain www.cafetarikh.com
Redirect Base cafetarikh.com
Domain IPs 87.107.133.152
Redirect IPs 87.107.133.152
Response IP 87.107.133.152
Found Yes
Hash d30f8338cce4423ed272c89f4226365ff8fa412fc1562f1796186b4254cfdc4f
SimHash e92d2dc4c9d3

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /backup/
Disallow /cache/
Disallow /errors/
Disallow /includes/
Disallow /languages/
Disallow /logs/
Disallow /modules/
Disallow /publish/
Disallow /tasks/

Other Records

Field Value
sitemap https://www.cafetarikh.com/sitemap-index.xml