news.smh.com.au
robots.txt

Robots Exclusion Standard data for news.smh.com.au

Resource Scan

Scan Details

Site Domain news.smh.com.au
Base Domain smh.com.au
Scan Status Ok
Last Scan2024-11-13T17:01:17+00:00
Next Scan 2024-11-20T17:01:17+00:00

Last Scan

Scanned2024-11-13T17:01:17+00:00
URL https://news.smh.com.au/robots.txt
Redirect https://www.smh.com.au/robots.txt
Redirect Domain www.smh.com.au
Redirect Base smh.com.au
Domain IPs 13.248.160.137, 75.2.43.150, 76.223.34.124, 99.83.186.106
Redirect IPs 151.101.130.133, 151.101.194.133, 151.101.2.133, 151.101.66.133, 2a04:4e42:200::645, 2a04:4e42:400::645, 2a04:4e42:600::645, 2a04:4e42::645
Response IP 199.232.46.133
Found Yes
Hash 7db8c19b6fbddf3d86e39ac72954e297c18d74b693346464ea9f239affe057e7
SimHash f451415bc7a7

Groups

*

Rule Path
Allow /
Disallow /search?text=*

anthropic-ai

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

google-cloudvertexbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

timpibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

webzio-extended

Rule Path
Disallow /

youbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.smh.com.au/sitemaps/news/brands/smh
sitemap https://www.smh.com.au/sitemaps/smh-sitemaps-videos.xml
sitemap https://www.smh.com.au/sitemaps/smh-navigation-pages.xml
sitemap https://www.smh.com.au/sitemaps/smh-sitemaps-articles.xml
sitemap https://www.smh.com.au/rss/feed.xml

Comments

  • Nine Entertainment Co expressly prohibits the use of any Nine
  • content or data, including associated metadata, for any machine
  • learning and/or artificial intelligence including for the purposes
  • of training or development of AI technology, tools and machine
  • learning language models.
  • view our terms of use - https://login.nine.com.au/terms?client_id=smh
  • Sitemaps
  • All visitors
  • Specific agents