plainsman.com
robots.txt

Robots Exclusion Standard data for plainsman.com

Resource Scan

Scan Details

Site Domain plainsman.com
Base Domain plainsman.com
Scan Status Ok
Last Scan2024-10-01T15:16:20+00:00
Next Scan 2024-10-08T15:16:20+00:00

Last Scan

Scanned2024-10-01T15:16:20+00:00
URL https://plainsman.com/robots.txt
Redirect https://www.plainsman.com/robots.txt
Redirect Domain www.plainsman.com
Redirect Base plainsman.com
Domain IPs 65.61.154.7
Redirect IPs 65.61.154.7
Response IP 65.61.154.7
Found Yes
Hash 0a208d2a3e0d0487ff7cb4d54c1b68504dd012b466d3b96c00b460738b70ecf1
SimHash 8473d9d6e9d3

Groups

*

Rule Path
Disallow /css/
Disallow /css_system/
Disallow /js/
Disallow /js_system/
Disallow /account/
Disallow /calendar/post/
Disallow /forms/
Disallow /login.html
Disallow /poll_process.html
Disallow /post_comments.html
Disallow /my_profile.html
Disallow /my_stuff.html
Disallow /user_profile.html
Disallow /ajax/
Disallow /register.html
Disallow /report_item.html
Disallow /send_item.html
Disallow /subscribe/
Disallow /renew/
Disallow /account/
Disallow /resetpassword/
Disallow /reset/
Disallow /alacarte/
Disallow /lookup/
Disallow /register-local/
Disallow /entercode/

Other Records

Field Value
sitemap http://plainsman.staging.communityq.com/sitemaps/sitemaps-r2-brookings-huron-1.xml
sitemap https://www.plainsman.com/sitemaps/sitemaps-r2-default-huron-1.xml
sitemap https://www.plainsman.com/sitemaps/sitemaps-r2-googlenews-huron-1.xml