newsroot.in
robots.txt

Robots Exclusion Standard data for newsroot.in

Resource Scan

Scan Details

Site Domain newsroot.in
Base Domain newsroot.in
Scan Status Ok
Last Scan2024-09-21T00:35:27+00:00
Next Scan 2024-09-28T00:35:27+00:00

Last Scan

Scanned2024-09-21T00:35:27+00:00
URL https://newsroot.in/robots.txt
Redirect https://www.newsroot.in/robots.txt
Redirect Domain www.newsroot.in
Redirect Base newsroot.in
Domain IPs 23.213.158.17, 23.213.158.26
Redirect IPs 23.213.158.23, 23.213.158.4
Response IP 23.213.158.23
Found Yes
Hash b7dfda4a8dcd82db1c9facfe7366cfad8533f7472d4f5e3bd39b1d87957db919
SimHash 6b685cc8abb0

Groups

*
*

Rule Path
Disallow /wp-admin/
Disallow /wp-content/
Disallow /wp-includes/
Disallow /h-upload/
Disallow /wp-login.php
Disallow /wp-register.php

Other Records

Field Value
sitemap https://www.newsroot.in/sitemap.xml
sitemap https://www.newsroot.in/news-sitemap.xml
sitemap https://www.newsroot.in/sitemap_index.xml
sitemap https://www.newsroot.in/category-sitemap.xml
sitemap https://www.newsroot.in/image-sitemap.xml

Warnings

  • 4 invalid lines.