aroostooknews.com
robots.txt
Robots Exclusion Standard data for aroostooknews.com
Resource Scan
Scan Details
Site Domain | aroostooknews.com |
Base Domain | aroostooknews.com |
Scan Status | Ok |
Last Scan | 2024-11-18T07:39:29+00:00 |
Next Scan | 2024-11-25T07:39:29+00:00 |
Last Scan
Scanned | 2024-11-18T07:39:29+00:00 |
URL | https://aroostooknews.com/robots.txt |
Domain IPs | 34.236.176.60 |
Response IP | 3.82.80.110 |
Found | Yes |
Hash | bb6d4cb249ac52f6e576e2d0fccc292b0ae2b10ad52656c21b8e214c878a0574 |
SimHash | 6e16d004c735 |
Groups
*
Rule | Path |
---|---|
Disallow | /search |
Disallow | /*.json |
Disallow | /v.js |
Disallow | /ad.js |
Other Records
Field | Value |
---|---|
sitemap | https://aroostooknews.com/sitemaps/aroostooknews/sitemap.xml.gz |
sitemap | https://aroostooknews.com/sitemaps/aroostooknews/sitemap_news.xml.gz |
Comments