west.newsnetmedia.com
robots.txt

Robots Exclusion Standard data for west.newsnetmedia.com

Resource Scan

Scan Details

Site Domain west.newsnetmedia.com
Base Domain newsnetmedia.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-07-02T16:53:43+00:00
Next Scan 2024-09-30T16:53:43+00:00

Last Successful Scan

Scanned2023-03-11T13:41:48+00:00
URL https://west.newsnetmedia.com/robots.txt
Domain IPs 104.18.30.13, 104.18.31.13, 2606:4700::6812:1e0d, 2606:4700::6812:1f0d
Response IP 104.18.31.13
Found Yes
Hash cc392cccc2d02384cbb330ef71dd5428bbe104a85505db606647842709bb69bf
SimHash 4d1d5f4c38db

Groups

*

Rule Path
Disallow /ads/
Disallow /global/tools/
Disallow /global/interfaces/
Disallow /global/images/
Disallow /global/include/
Disallow /global/applications/
Disallow /global/pm/
Disallow /global/utilities/
Disallow /global/reports/
Disallow /global/video/
Disallow /applications/
Disallow /cgi-bin/
Disallow /classifieds/
Disallow /default_files/
Disallow /images/
Disallow /include/
Disallow /incoming/
Disallow /reports/
Disallow /professionalservices/
Disallow /search
Disallow /temp/
Disallow /trafficcam/
Disallow /traffic/
Disallow /contentmgmt/
Disallow /link/
Disallow /register
Disallow /login
Disallow /forgot-password
Disallow /reset-password
Disallow /profile

Other Records

Field Value
crawl-delay 3

Other Records

Field Value
sitemap https://west.newsnetmedia.com/sitemap.xml.gz
sitemap https://west.newsnetmedia.com/sitemap-pages.xml.gz
sitemap https://west.newsnetmedia.com/newssitemap.xml.gz
sitemap https://west.newsnetmedia.com/videositemap.xml.gz