gp.townhall.com
robots.txt
Robots Exclusion Standard data for gp.townhall.com
Resource Scan
Scan Details
Site Domain | gp.townhall.com |
Base Domain | townhall.com |
Scan Status | Ok |
Last Scan | 2024-11-13T07:23:44+00:00 |
Next Scan | 2024-11-20T07:23:44+00:00 |
Last Scan
Scanned | 2024-11-13T07:23:44+00:00 |
URL | https://gp.townhall.com/robots.txt |
Redirect | https://townhall.com/robots.txt |
Redirect Domain | townhall.com |
Redirect Base | townhall.com |
Domain IPs | 104.18.12.37, 104.18.13.37, 2606:4700::6812:c25, 2606:4700::6812:d25 |
Redirect IPs | 104.18.12.37, 104.18.13.37, 2606:4700::6812:c25, 2606:4700::6812:d25 |
Response IP | 104.18.13.37 |
Found | Yes |
Hash | a14ea60483280d73a822cac6e1503722a3402872c3d6395a12eaaaf0af07291f |
SimHash | 8a0d98a0d932 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /podcastfeed/vip/ |
Disallow | /feed/v1/podcast/triggered-uncensored |
Disallow | /feed/v1/podcast/unredacted-with-kurt-schlichter |
Other Records
Field | Value |
---|---|
sitemap | https://townhall.com/sitemaps/sitemapindex-townhall.xml |
sitemap | https://townhall.com/tipsheet/sitemap.xml |