themercury.com.au
robots.txt

Robots Exclusion Standard data for themercury.com.au

Resource Scan

Scan Details

Site Domain themercury.com.au
Base Domain themercury.com.au
Scan Status Ok
Last Scan2024-06-04T23:49:22+00:00
Next Scan 2024-06-11T23:49:22+00:00

Last Scan

Scanned2024-06-04T23:49:22+00:00
URL https://themercury.com.au/robots.txt
Redirect https://www.themercury.com.au/robots.txt
Redirect Domain www.themercury.com.au
Redirect Base themercury.com.au
Domain IPs 184.51.96.158
Redirect IPs 184.51.96.158
Response IP 184.51.96.158
Found Yes
Hash 73c3d94d093faf8cda60c4a00c9dc4d562d4992a92c848f75f46a9bc0da3388a
SimHash 502c5951e9d3

Groups

newsnow

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

*

Rule Path
Disallow /*/comments-*
Disallow /404
Disallow /enewsletters/*
Disallow /doublerainbow/*
Disallow /it-test-only/*

Other Records

Field Value
sitemap https://www.themercury.com.au/sitemap.xml
sitemap https://www.themercury.com.au/news-sitemap.xml

Comments

  • Agent Specific Disallowed Sections