topparrain.com
robots.txt
Robots Exclusion Standard data for topparrain.com
Resource Scan
Scan Details
Site Domain | topparrain.com |
Base Domain | topparrain.com |
Scan Status | Ok |
Last Scan | 2024-09-20T23:30:47+00:00 |
Next Scan | 2024-09-27T23:30:47+00:00 |
Last Scan
Scanned | 2024-09-20T23:30:47+00:00 |
URL | https://topparrain.com/robots.txt |
Redirect | https://www.topparrain.com/robots.txt |
Redirect Domain | www.topparrain.com |
Redirect Base | topparrain.com |
Domain IPs | 198.185.159.144, 198.185.159.145 |
Redirect IPs | 108.128.72.146, 54.216.252.255, 54.73.26.109 |
Response IP | 54.73.26.109 |
Found | Yes |
Hash | dbf956095a0e9235de8d6ff522158a5ba7e485715431a00b06ea350af0ae88b0 |
SimHash | 6f87bc70ec13 |
Groups
*
Rule | Path |
---|---|
Allow | /ads/preferences/ |
Allow | /gpt/ |
Allow | /pagead/show_ads.js |
Allow | /pagead/js/adsbygoogle.js |
Allow | /pagead/js/*/show_ads_impl.js |
Allow | /static/glade.js |
Allow | /static/glade/ |
Other Records
Field | Value |
---|---|
crawl-delay | 1 |
Other Records
Field | Value |
---|---|
sitemap | https://topparrain.s3.us-east-2.amazonaws.com/sitemaps/sitemap.xml.gz |