puzzleiran.com
robots.txt

Robots Exclusion Standard data for puzzleiran.com

Resource Scan

Scan Details

Site Domain puzzleiran.com
Base Domain puzzleiran.com
Scan Status Ok
Last Scan2025-08-29T04:53:03+00:00
Next Scan 2025-09-28T04:53:03+00:00

Last Scan

Scanned2025-08-29T04:53:03+00:00
URL https://puzzleiran.com/robots.txt
Domain IPs 89.235.79.4
Response IP 89.235.79.4
Found Yes
Hash aee4454467fecb9751e9d27a50250faaaf1246fb6ac166f5a0bdb1456fda61d1
SimHash c1001c33cd92

Groups

scrapy

Rule Path
Allow /

*

Rule Path
Disallow /calendar/action~posterboard/
Disallow /calendar/action~agenda/
Disallow /calendar/action~oneday/
Disallow /calendar/action~month/
Disallow /calendar/action~week/
Disallow /calendar/action~stream/
Disallow /calendar/action~undefined/
Disallow /calendar/action~http%3A/
Disallow /calendar/action~default/
Disallow /calendar/action~poster/
Disallow /calendar/action~*/
Disallow /*controller%3Dai1ec_exporter_controller*
Disallow /*/action~*/

Other Records

Field Value
sitemap https://puzzleiran.com/sitemap.xml
sitemap sitemap.xml