cafetarikh.com
robots.txt
Robots Exclusion Standard data for cafetarikh.com
Resource Scan
Scan Details
| Site Domain | cafetarikh.com |
| Base Domain | cafetarikh.com |
| Scan Status | Ok |
| Last Scan | 2025-12-01T21:24:33+00:00 |
| Next Scan | 2025-12-31T21:24:33+00:00 |
Last Scan
| Scanned | 2025-12-01T21:24:33+00:00 |
| URL | https://cafetarikh.com/robots.txt |
| Redirect | https://www.cafetarikh.com/robots.txt |
| Redirect Domain | www.cafetarikh.com |
| Redirect Base | cafetarikh.com |
| Domain IPs | 87.107.133.152 |
| Redirect IPs | 87.107.133.152 |
| Response IP | 87.107.133.152 |
| Found | Yes |
| Hash | d30f8338cce4423ed272c89f4226365ff8fa412fc1562f1796186b4254cfdc4f |
| SimHash | e92d2dc4c9d3 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /cgi-bin/ |
| Disallow | /backup/ |
| Disallow | /cache/ |
| Disallow | /errors/ |
| Disallow | /includes/ |
| Disallow | /languages/ |
| Disallow | /logs/ |
| Disallow | /modules/ |
| Disallow | /publish/ |
| Disallow | /tasks/ |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.cafetarikh.com/sitemap-index.xml |