newsjunction.in
robots.txt

Robots Exclusion Standard data for newsjunction.in

Resource Scan

Scan Details

Site Domain newsjunction.in
Base Domain newsjunction.in
Scan Status Ok
Last Scan2026-03-12T12:49:21+00:00
Next Scan 2026-03-19T12:49:21+00:00

Last Scan

Scanned2026-03-12T12:49:21+00:00
URL https://www.newsjunction.in/robots.txt
Domain IPs 142.251.10.121, 2404:6800:4003:c02::79
Response IP 142.251.10.121
Found Yes
Hash 4f52f39ed3259e02d11f8cd7feaf08b9f26cc5c60b1da730dc2a490217785ea1
SimHash 0914da504f53

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /search
Disallow /share-widget
Allow /

Other Records

Field Value
sitemap https://www.newsjunction.in/sitemap.xml