dawgpost.com
robots.txt

Robots Exclusion Standard data for dawgpost.com

Resource Scan

Scan Details

Site Domain dawgpost.com
Base Domain dawgpost.com
Scan Status Ok
Last Scan2024-11-08T07:26:59+00:00
Next Scan 2024-11-15T07:26:59+00:00

Last Scan

Scanned2024-11-08T07:26:59+00:00
URL https://dawgpost.com/robots.txt
Domain IPs 40.119.40.202
Response IP 40.119.40.202
Found Yes
Hash 405e8d86b45a8a9819c00e24649dc10f8dd1c4496fec2f74708f4a880b144baa
SimHash f90124cd9515

Groups

*

Rule Path
Disallow /modules/foundationarticles
Disallow /news/link/
Disallow /modules/sportsblock
Disallow /modules/maroonbookspotlight
Disallow /forums/_post
Disallow /store/_store_addtocart
Disallow /store/_store_items
Disallow /store/_skeleton
Disallow /store/_store_items_list
Disallow /store/_store_updatecart
Disallow /account/validate
Disallow /home/small
Disallow /forums/_potdpost
Disallow /recruiting/_sidecommits
Disallow /premium/feedpartial
Disallow /premium/_feedinner
Disallow /players/_recruitupdate
Disallow /media/customize
Disallow /media/playerdata
Disallow /players/_recruitupdate
Disallow /newsletter/unsubscribe
Disallow /account/profile

Other Records

Field Value
sitemap http://dawgpost.com/sitemap.xml