newslanes.com
robots.txt
Robots Exclusion Standard data for newslanes.com
Resource Scan
Scan Details
Site Domain | newslanes.com |
Base Domain | newslanes.com |
Scan Status | Ok |
Last Scan | 2024-05-21T22:58:44+00:00 |
Next Scan | 2024-05-28T22:58:44+00:00 |
Last Scan
Scanned | 2024-05-21T22:58:44+00:00 |
URL | https://newslanes.com/robots.txt |
Domain IPs | 2600:1f10:4ad3:5900:fb39:c8c3:59f3:8f1b, 34.193.38.8 |
Response IP | 34.193.38.8 |
Found | Yes |
Hash | fda03bb2e151e40e5bc188c96121c7c2a05fa2d48585d46ffbc0f2d85067c5fd |
SimHash | d9150fd4cf53 |
Groups
*
Rule | Path |
---|---|
Disallow | */trackback/ |
Disallow | */xmlrpc.php |
Disallow | /wp-*.php |
Disallow | /cgi-bin/ |
Disallow | /wp-admin/ |
Allow | */wp-content/uploads/ |
Other Records
Field | Value |
---|---|
sitemap | https://newslanes.com/sitemap.xml |
sitemap | https://newslanes.com/sitemap-home.xml |
sitemap | https://newslanes.com/sitemap-news.xml |
sitemap | https://newslanes.com/sitemap-posts.xml |
sitemap | https://newslanes.com/sitemap-pages.xml |
sitemap | https://newslanes.com/sitemap-categories.xml |
sitemap | https://newslanes.com/sitemap-tags.xml |
sitemap | https://newslanes.com/sitemap-archives.xml |
Comments