topparrain.com
robots.txt

Robots Exclusion Standard data for topparrain.com

Resource Scan

Scan Details

Site Domain topparrain.com
Base Domain topparrain.com
Scan Status Ok
Last Scan2024-09-20T23:30:47+00:00
Next Scan 2024-09-27T23:30:47+00:00

Last Scan

Scanned2024-09-20T23:30:47+00:00
URL https://topparrain.com/robots.txt
Redirect https://www.topparrain.com/robots.txt
Redirect Domain www.topparrain.com
Redirect Base topparrain.com
Domain IPs 198.185.159.144, 198.185.159.145
Redirect IPs 108.128.72.146, 54.216.252.255, 54.73.26.109
Response IP 54.73.26.109
Found Yes
Hash dbf956095a0e9235de8d6ff522158a5ba7e485715431a00b06ea350af0ae88b0
SimHash 6f87bc70ec13

Groups

*

Rule Path
Allow /ads/preferences/
Allow /gpt/
Allow /pagead/show_ads.js
Allow /pagead/js/adsbygoogle.js
Allow /pagead/js/*/show_ads_impl.js
Allow /static/glade.js
Allow /static/glade/

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://topparrain.s3.us-east-2.amazonaws.com/sitemaps/sitemap.xml.gz