humboldtinsider.com
robots.txt

Robots Exclusion Standard data for humboldtinsider.com

Resource Scan

Scan Details

Site Domain humboldtinsider.com
Base Domain humboldtinsider.com
Scan Status Ok
Last Scan2024-06-29T03:32:25+00:00
Next Scan 2024-07-06T03:32:25+00:00

Last Scan

Scanned2024-06-29T03:32:25+00:00
URL https://humboldtinsider.com/robots.txt
Redirect https://www.humboldtinsider.com/robots.txt
Redirect Domain www.humboldtinsider.com
Redirect Base humboldtinsider.com
Domain IPs 104.21.14.75, 172.67.158.46, 2606:4700:3030::6815:e4b, 2606:4700:3037::ac43:9e2e
Redirect IPs 104.21.14.75, 172.67.158.46, 2606:4700:3030::6815:e4b, 2606:4700:3037::ac43:9e2e
Response IP 104.21.14.75
Found Yes
Hash f6cc9b39dbcc34db0f1b3eae47dbecaa4d904a298be263090e1387f6a0fa2887
SimHash 7171fc03c8f3

Groups

*

Rule Path
Disallow /humboldt/ArticleArchives
Disallow /humboldt/CommentArchives
Disallow /humboldt/EventSearch
Disallow /humboldt/ImageArchives
Disallow /humboldt/FilmSearch
Disallow /humboldt/LocationSearch
Disallow /humboldt/MemberSearch
Disallow /humboldt/MovieTimes
Disallow /humboldt/Search
Disallow /humboldt/SlideshowArchives
Disallow /humboldt/VideoArchives

Other Records

Field Value
sitemap https://www.humboldtinsider.com/humboldt/Sitemap.xml