happyhooligans.wordpress.com
robots.txt

Robots Exclusion Standard data for happyhooligans.wordpress.com

Resource Scan

Scan Details

Site Domain happyhooligans.wordpress.com
Base Domain wordpress.com
Scan Status Ok
Last Scan2026-01-08T01:52:28+00:00
Next Scan 2026-02-07T01:52:28+00:00

Last Scan

Scanned2026-01-08T01:52:28+00:00
URL https://happyhooligans.wordpress.com/robots.txt
Redirect https://happyhooligans.ca/robots.txt
Redirect Domain happyhooligans.ca
Redirect Base happyhooligans.ca
Domain IPs 192.0.78.12, 192.0.78.13
Redirect IPs 158.69.57.77
Response IP 158.69.57.77
Found Yes
Hash 550265bcb0e4893a9060e76a97b5c7d2e76db50c0a999bca3ea99a3fcccb610a
SimHash d905d843e192

Groups

scrapy

Rule Path
Allow /

*

Rule Path
Disallow /search/
Disallow /search/*
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://happyhooligans.ca/sitemap_index.xml