prabhasakshi.com
robots.txt

Robots Exclusion Standard data for prabhasakshi.com

Resource Scan

Scan Details

Site Domain prabhasakshi.com
Base Domain prabhasakshi.com
Scan Status Ok
Last Scan2024-06-06T02:11:33+00:00
Next Scan 2024-06-13T02:11:33+00:00

Last Scan

Scanned2024-06-06T02:11:33+00:00
URL https://prabhasakshi.com/robots.txt
Redirect https://www.prabhasakshi.com/robots.txt
Redirect Domain www.prabhasakshi.com
Redirect Base prabhasakshi.com
Domain IPs 104.26.4.32, 104.26.5.32, 172.67.70.128, 2606:4700:20::681a:420, 2606:4700:20::681a:520, 2606:4700:20::ac43:4680
Redirect IPs 104.26.4.32, 104.26.5.32, 172.67.70.128, 2606:4700:20::681a:420, 2606:4700:20::681a:520, 2606:4700:20::ac43:4680
Response IP 104.26.4.32
Found Yes
Hash e6501bdaf51ddaf518eb5b845ea6464cf574c6140debdfbb78f52aa93fd61e03
SimHash 4b04d0e75300

Groups

*

Rule Path
Allow /
Allow /topics/*.xml
Allow /topics/*.xml/*
Allow /topics/video/*.xml
Disallow */https%3A//cms2.prabhasakshi.com/*
Disallow /videos/*.html
Disallow /api/*
Disallow /archive/*

mediapartners-google

Rule Path
Allow /
Allow /topics/*.xml
Allow /topics/*.xml/*
Allow /topics/video/*.xml
Disallow */https%3A//cms2.prabhasakshi.com/*
Disallow /videos/*.html
Disallow /api/*
Disallow /archive/*

googlebot-news

Rule Path
Allow /
Disallow */https%3A//cms2.prabhasakshi.com/*
Disallow /videos/*.html
Disallow /api/*
Disallow /archive/*

googlebot

Rule Path
Allow /
Disallow */https%3A//cms2.prabhasakshi.com/*
Disallow /videos/*.html
Disallow /api/*
Disallow /archive/*

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.prabhasakshi.com/news-sitemap.xml
sitemap https://www.prabhasakshi.com/video-sitemap.xml
sitemap https://www.prabhasakshi.com/general-sitemap.xml
sitemap https://www.prabhasakshi.com/image-sitemap.xml
sitemap https://www.prabhasakshi.com/single-image-sitemap.xml
sitemap https://www.prabhasakshi.com/sitemap.xml

Comments

  • Sitemap archive