newsso.in
robots.txt

Robots Exclusion Standard data for newsso.in

Resource Scan

Scan Details

Site Domain newsso.in
Base Domain newsso.in
Scan Status Ok
Last Scan3/15/2025, 1:03:00 PM
Next Scan 3/22/2025, 1:03:00 PM

Last Scan

Scanned3/15/2025, 1:03:00 PM
URL https://newsso.in/robots.txt
Domain IPs 104.21.14.42, 172.67.157.183, 2606:4700:3034::ac43:9db7, 2606:4700:3037::6815:e2a
Response IP 104.21.14.42
Found Yes
Hash be8052db27782dce0daea5dc8f8fa388e02adf563effa3e5186632195595cae5
SimHash 49448c75c7b1

Groups

*

Rule Path
Allow /

Other Records

Field Value
sitemap https://newsso.in/sitemap.xml

Warnings

  • `host` is not a known field.