newsheadline.net
robots.txt

Robots Exclusion Standard data for newsheadline.net

Resource Scan

Scan Details

Site Domain newsheadline.net
Base Domain newsheadline.net
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-10-15T11:15:33+00:00
Next Scan 2025-01-13T11:15:33+00:00

Last Successful Scan

Scanned2024-02-26T11:10:40+00:00
URL https://newsheadline.net/robots.txt
Domain IPs 104.21.46.16, 172.67.222.137, 2606:4700:3031::6815:2e10, 2606:4700:3031::ac43:de89
Response IP 172.67.222.137
Found Yes
Hash 926926cd8b1c7861b2367d9637887d9ca07230abef3b6d545fd454af704f9e3b
SimHash e1019a22cfb2

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://newsheadline.net/sitemap.xml
sitemap https://newsheadline.net/google-news.xml