odia.news18.com
robots.txt

Robots Exclusion Standard data for odia.news18.com

Resource Scan

Scan Details

Site Domain odia.news18.com
Base Domain news18.com
Scan Status Ok
Last Scan2024-05-02T04:01:27+00:00
Next Scan 2024-06-01T04:01:27+00:00

Last Scan

Scanned2024-05-02T04:01:27+00:00
URL https://odia.news18.com/robots.txt
Domain IPs 23.52.114.70, 2600:1413:b000:38c::3379, 2600:1413:b000:390::3379
Response IP 23.54.58.50
Found Yes
Hash a8ee19281fcff7aadd99e20e04fd8e7c3ba3821cebb7a5d87571e62977345ff2
SimHash 683959508dc7

Groups

*

Rule Path
Allow /
Disallow /api/
Disallow /includes/
Disallow /uc/
Disallow /SolrPhpClient/
Disallow /1039154/
Disallow /*page-1*page-1$
Disallow /scorecard/
Disallow /byline/*/page-*/
Disallow /notifications/
Disallow /articles-sitemap
Disallow /image-sitemap
Disallow /vod/
Disallow /news/page-*/
Disallow /videos/page-*/
Disallow /videos-sitemap
Disallow /videos/*/page-*/
Disallow /news/page-*/*/page-*/
Disallow /*/category/*/
Disallow /byline/*/videos/
Disallow /bypoll-2021/
Disallow /tag/*/news/
Disallow /tag/*/photogallery/
Disallow /tag/*/videos/
Disallow /1
Disallow /2
Disallow /3
Disallow /4
Disallow /6
Disallow /7
Disallow /8
Disallow /9
Disallow /0

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

mazbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://odia.news18.com/commonfeeds/v1/odi/sitemap/today
sitemap https://odia.news18.com/commonfeeds/v1/odi/sitemap-index.xml
sitemap https://odia.news18.com/commonfeeds/v1/odi/sitemap/google-news.xml
sitemap https://odia.news18.com/commonfeeds/v1/odi/sitemap-image-index.xml
sitemap https://odia.news18.com/commonfeeds/v1/odi/sitemap-video-index.xml
sitemap https://odia.news18.com/commonfeeds/v1/odi/sitemap/webstories-sitemap-index.xml

Warnings

  • 1 invalid line.