heraldtimesonline.com
robots.txt

Robots Exclusion Standard data for heraldtimesonline.com

Resource Scan

Scan Details

Site Domain heraldtimesonline.com
Base Domain heraldtimesonline.com
Scan Status Ok
Last Scan2024-06-01T20:22:24+00:00
Next Scan 2024-06-08T20:22:24+00:00

Last Scan

Scanned2024-06-01T20:22:24+00:00
URL https://heraldtimesonline.com/robots.txt
Redirect https://www.heraldtimesonline.com/robots.txt
Redirect Domain www.heraldtimesonline.com
Redirect Base heraldtimesonline.com
Domain IPs 151.101.202.62
Redirect IPs 151.101.130.62, 151.101.194.62, 151.101.2.62, 151.101.66.62
Response IP 199.232.46.62
Found Yes
Hash d6384935edf73727cf2042008aa4bd26ba2d030f9d91357ef1bb5da768274959
SimHash b98c1375f7c0

Groups

anthropic-ai

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

googlebot-news

Rule Path
Disallow /story/sponsor-story/
Disallow /picture-gallery/sponsor-story/
Disallow /videos/sponsor-story/
Disallow /longform/sponsor-story/
Disallow /pages/interactives/sponsor-story/
Disallow /interactives/sponsor-story/
Disallow /videos/embed/

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

newsnow

Rule Path
Disallow /

news-please

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

peer39_crawler

Rule Path
Disallow /

peer39_crawler/1.0

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

*

Rule Path
Disallow /errors
Disallow /interactive/
Disallow /userauth/
Disallow /ugc/
Disallow /feeds/
Disallow /services/
Disallow /facebook/
Disallow /version-info/
Disallow /longform/draft/
Disallow /story/draft/
Disallow /topic/*/smart/
Disallow /search
Disallow /module-showcase/
Disallow /newsletter/
Disallow /blended-newsletter/
Disallow /story/nletter/
Disallow /sports/services/photos/
Disallow /optimus
Disallow /ux-train
Disallow /story/advisory/
Disallow /.cam-tangent/
Disallow /pbd/
Disallow /gciaf/
Disallow /tncms/tracking/
Disallow /_services/
Disallow /tncms/search/
Disallow /tncms/openweb/*
Disallow /tncms/openid2/
Disallow /tncms/webservice/
Disallow /tncms/auth/
Disallow /tncms/admin/
Disallow /tncms/block/
Disallow /tncms/messaging/
Disallow /tncms/counter/
Disallow /tncms/gtm/
Disallow /tncms/media/
Disallow /tncms/disqus/
Disallow /tncms/user/
Disallow /users/admin/
Disallow /users/login/?
Disallow /users/signup/?
Disallow /marketplace/*action%3Dsrch
Disallow /tncms/calendar/
Disallow /calendar/search/
Disallow /calendar/art_exhibits/search/
Disallow /calendar/comedy/search/
Disallow /calendar/dance/search/
Disallow /calendar/films/search/
Disallow /calendar/music/search/
Disallow /calendar/reading/search/
Disallow /calendar/sports/search/
Disallow /calendar/talks_classes/search/
Disallow /calendar/theatre/search/
Disallow /herald_times_online/photo_assignment_calendar/search/
Disallow /herald_times_online/photo_assignment_calendar/jeremy_hogan/search/
Disallow /herald_times_online/photo_assignment_calendar/chris_howell/search/
Disallow /herald_times_online/photo_assignment_calendar/reporter/search/
Disallow /herald_times_online/photo_assignment_calendar/freelancer/search/
Disallow /calendar/art_events/search/
Disallow /calendar/trivia/search/
Disallow /calendar/festivities/search/
Disallow /calendar/kids/search/
Disallow /spencer_evening_world/calendar/search/
Disallow /calendar/meetings/search/
Disallow /calendar/fitness/search/
Disallow /classifieds/*?
Disallow /classifieds/cars/*?
Disallow /classifieds/cars/research/*?
Disallow /classifieds/cars/sell/*?
Disallow /classifieds/general/*?
Disallow /classifieds/jobs/*?
Disallow /classifieds/pets/*?
Disallow /classifieds/placeanad/*?
Disallow /classifieds/realestate/*?

Other Records

Field Value
sitemap https://www.heraldtimesonline.com/news-sitemap.xml
sitemap https://www.heraldtimesonline.com/web-sitemap-index.xml
sitemap https://www.heraldtimesonline.com/video-sitemap-index.xml
sitemap https://www.heraldtimesonline.com/obituaries/sitemap/index.xml
sitemap https://www.heraldtimesonline.com/sitemap.xml

Comments

  • robots.txt file for https://www.heraldtimesonline.com/