acadiananow.com
robots.txt

Robots Exclusion Standard data for acadiananow.com

Resource Scan

Scan Details

Site Domain acadiananow.com
Base Domain acadiananow.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-08-31T14:36:38+00:00
Next Scan 2024-11-29T14:36:38+00:00

Last Successful Scan

Scanned2022-04-13T13:11:17+00:00
URL http://acadiananow.com/robots.txt
Redirect https://www.theadvertiser.com/robots.txt
Redirect Domain www.theadvertiser.com
Redirect Base theadvertiser.com
Response IP 199.232.46.62
Found Yes
Hash a5bcecd25e2f1299aa05e201c3ab21de344848c578eb0f2689e5a0f95787c236
SimHash 2b8c1fc755f2

Groups

googlebot-news

Rule Path
Disallow /story/sponsor-story/
Disallow /picture-gallery/sponsor-story/
Disallow /videos/sponsor-story/
Disallow /longform/sponsor-story/
Disallow /pages/interactives/sponsor-story/
Disallow /interactives/sponsor-story/
Disallow /videos/embed/

*

Rule Path
Disallow /errors
Disallow /interactive/
Disallow /userauth/
Disallow /ugc/
Disallow /feeds/
Disallow /services/
Disallow /facebook/
Disallow /version-info/
Disallow /longform/draft/
Disallow /story/draft/
Disallow /topic/*/smart/
Disallow /search
Disallow /module-showcase/
Disallow /newsletter/
Disallow /blended-newsletter/
Disallow /story/nletter/
Disallow /sports/services/photos/
Disallow /optimus
Disallow /ux-train
Disallow /story/advisory/

Other Records

Field Value
sitemap https://www.theadvertiser.com/news-sitemap.xml
sitemap https://www.theadvertiser.com/web-sitemap-index.xml
sitemap https://www.theadvertiser.com/video-sitemap-index.xml

Comments

  • robots.txt file for https://www.theadvertiser.com/