newsm.com
robots.txt

Robots Exclusion Standard data for newsm.com

Resource Scan

Scan Details

Site Domain newsm.com
Base Domain newsm.com
Scan Status Ok
Last Scan2024-06-01T05:49:30+00:00
Next Scan 2024-07-01T05:49:30+00:00

Last Scan

Scanned2024-06-01T05:49:30+00:00
URL https://newsm.com/robots.txt
Redirect https://www.newsm.com/robots.txt
Redirect Domain www.newsm.com
Redirect Base newsm.com
Domain IPs 115.85.177.60, 118.67.141.100
Redirect IPs 115.85.177.60, 118.67.141.100
Response IP 118.67.141.100
Found Yes
Hash 1d46bc6ebf25daec0bae8b23bae743cda254f522d33d6e0fab7211a891b51dd9
SimHash 690d3800c291

Groups

*

Rule Path
Disallow /admin/

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 30

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.newsm.com/sitemap.xml