newsx.com
robots.txt

Robots Exclusion Standard data for newsx.com

Resource Scan

Scan Details

Site Domain newsx.com
Base Domain newsx.com
Scan Status Ok
Last Scan2024-06-07T10:54:56+00:00
Next Scan 2024-06-14T10:54:56+00:00

Last Scan

Scanned2024-06-07T10:54:56+00:00
URL https://newsx.com/robots.txt
Redirect https://www.newsx.com/robots.txt
Redirect Domain www.newsx.com
Redirect Base newsx.com
Domain IPs 139.84.177.81
Redirect IPs 139.84.177.81
Response IP 139.84.177.81
Found Yes
Hash 8fe829f5bd0ae6f1dbb5bca971aae12d2fc96686d03101fb2a31fa6d03ca4274
SimHash 234913955521

Groups

*

Rule Path
Allow /

googlebot

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

googlebot-news

Rule Path
Allow /

mediapartners-google

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

googlebot-image

No rules defined. All paths allowed.

msnbot

Rule Path
Allow /

bingbot

Rule Path
Allow /

slurp

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.newsx.com/sitemap-entertainment.xml
sitemap https://www.newsx.com/sitemap-gadgets.xml
sitemap https://www.newsx.com/sitemap-auto.xml
sitemap https://www.newsx.com/sitemap-national.xml
sitemap https://www.newsx.com/sitemap-world.xml
sitemap https://www.newsx.com/news-sitemap.xml

Warnings

  • 1 invalid line.