news18marathi.com
robots.txt

Robots Exclusion Standard data for news18marathi.com

Resource Scan

Scan Details

Site Domain news18marathi.com
Base Domain news18marathi.com
Scan Status Ok
Last Scan2024-11-13T20:18:31+00:00
Next Scan 2024-11-20T20:18:31+00:00

Last Scan

Scanned2024-11-13T20:18:31+00:00
URL https://news18marathi.com/robots.txt
Domain IPs 23.215.7.12, 23.215.7.23, 2600:1413:b000:1b::17d7:70c, 2600:1413:b000:1b::17d7:717
Response IP 23.215.7.12
Found Yes
Hash 6a15da5f78e74d06bad03d5bcfa00db4bcfcf9ea6bbbc09bf55f9ccac50e4d04
SimHash ec8598476581

Groups

*

Rule Path
Allow /
Disallow /*/undefined/
Disallow /cricketnext/
Disallow /ganesh-chaturthi-festival/
Disallow /elections/winner/
Disallow /board-results-pubstack/
Disallow /elections/lok-sabha/orissa/
Disallow /elections/lok-sabha/chattisgarh/
Disallow /elections/lok-sabha-election-schedule
Disallow /elections/lok-sabha/nct-of-delhi/
Disallow /elections/winner/
Disallow /elections/*/assembly-election-all-winners-list/
Disallow /elections/*/assembly-election-result/
Disallow /amp/tag/tags-Q/
Disallow /amp/tag/gmail/
Disallow /amp/tag/tags-Z/
Disallow /photogallery/coronavirus-latest-news/
Disallow /amp/tag/pgstory/

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

mazbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://news18marathi.com/commonfeeds/v1/mar/sitemap-index.xml
sitemap https://news18marathi.com/commonfeeds/v1/mar/sitemap/today
sitemap https://news18marathi.com/commonfeeds/v1/mar/sitemap/google-news.xml
sitemap https://news18marathi.com/commonfeeds/v1/mar/sitemap-video-index.xml
sitemap https://news18marathi.com/commonfeeds/v1/mar/sitemap/webstories-sitemap-index.xml
sitemap https://news18marathi.com/commonfeeds/v1/mar/sitemap-image-index.xml