thehinducentre.com
robots.txt

Robots Exclusion Standard data for thehinducentre.com

Resource Scan

Scan Details

Site Domain thehinducentre.com
Base Domain thehinducentre.com
Scan Status Ok
Last Scan2024-11-04T06:56:45+00:00
Next Scan 2024-11-11T06:56:45+00:00

Last Scan

Scanned2024-11-04T06:56:45+00:00
URL https://thehinducentre.com/robots.txt
Redirect https://www.thehinducentre.com/robots.txt
Redirect Domain www.thehinducentre.com
Redirect Base thehinducentre.com
Domain IPs 104.18.84.224, 104.18.85.224, 2606:4700::6812:54e0, 2606:4700::6812:55e0
Redirect IPs 104.18.84.224, 104.18.85.224, 2606:4700::6812:54e0, 2606:4700::6812:55e0
Response IP 104.18.85.224
Found Yes
Hash 176c685428965a0ac4b21d7d715bd937923e942e050eed4a833a5ba3dcd1ab41
SimHash a9327d426781

Groups

*

Rule Path
Disallow /?type=commentReceipt
Disallow /cgi-bin/
Disallow /cdn-cgi/*
Disallow /config/
Disallow /nic/
Disallow /search/*
Disallow /search/
Disallow /SEARCH/
Disallow /Search/
Disallow /newsletter/
Disallow /newsletter/*
Disallow /config/*
Disallow /*?date=*
Disallow */analysis-logger/*
Disallow /todayspaper/
Disallow /today-paper/
Disallow /todays-paper/
Disallow /world-news-day/
Disallow */wf.fragment/*
Disallow */article30471298.ece/amp/*
Disallow */article9636950.ece/amp/*
Disallow */article30358181.ece/amp/*
Disallow */article30389945.ece/amp/*
Disallow */article30483913.ece/amp/*
Disallow */article30483913.ece/amp/*
Disallow *ref%3D*
Disallow *textsize%3D*
Disallow *test%3D*
Disallow *css%3D*
Disallow */?_ptid=*
Disallow /archive/print/
Disallow /companies/announcements/*
Allow /?service=googlenews
Allow /?service=newssitemap
Allow /todays-paper/*/alternates/*
Disallow *?_ptid=*
Disallow *%26_ptid%3D*
Disallow */?*&page=*
Disallow */?page=*&*

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.thehindubusinessline.com/sitemap/googlenews.xml
sitemap https://www.thehindubusinessline.com/sitemap/update.xml
sitemap https://www.thehindubusinessline.com/sitemap/archive.xml

Comments

  • Disallow: /static/
  • Block all paginations except topics temporarily until CUE
  • Disallow ChatGPT from extracting or interpreting our content