headlineusa.com
robots.txt

Robots Exclusion Standard data for headlineusa.com

Resource Scan

Scan Details

Site Domain headlineusa.com
Base Domain headlineusa.com
Scan Status Ok
Last Scan2024-06-01T16:26:59+00:00
Next Scan 2024-06-08T16:26:59+00:00

Last Scan

Scanned2024-06-01T16:26:59+00:00
URL https://headlineusa.com/robots.txt
Domain IPs 104.18.0.244, 104.18.1.244, 2606:4700::6812:1f4, 2606:4700::6812:f4
Response IP 104.18.0.244
Found Yes
Hash 82cde2feca956cbcf4ea4a19629d359fc9e89216e2973a16bff2f6d6339aa649
SimHash d8615c408b92

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-login.php
Disallow /*?s=*

*

Rule Path
Disallow /wp-content/uploads/wpo-plugins-tables-list.json

Other Records

Field Value
sitemap https://headlineusa.com/wp-sitemap.xml