theirishcurse.com
robots.txt

Robots Exclusion Standard data for theirishcurse.com

Resource Scan

Scan Details

Site Domain theirishcurse.com
Base Domain theirishcurse.com
Scan Status Ok
Last Scan2026-01-05T14:03:10+00:00
Next Scan 2026-02-04T14:03:10+00:00

Last Scan

Scanned2026-01-05T14:03:10+00:00
URL https://theirishcurse.com/robots.txt
Domain IPs 104.21.34.46, 172.67.197.235, 2606:4700:3034::ac43:c5eb, 2606:4700:3036::6815:222e
Response IP 172.67.197.235
Found Yes
Hash e428de13317cf5cd1d58c8b57b1714654aa8cfefe366a26f6f65a736485672ab
SimHash 00151df12f90

Groups

*

Rule Path
Disallow /cgi-bin
Disallow *?s=*
Disallow */trackback
Disallow */feed
Disallow */rss
Disallow */go.php*
Disallow */go/*

Other Records

Field Value
sitemap https://www.theirishcurse.com/sitemap_index.xml