newsbreak.ng
robots.txt

Robots Exclusion Standard data for newsbreak.ng

Resource Scan

Scan Details

Site Domain newsbreak.ng
Base Domain newsbreak.ng
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a server error.
Last Scan2024-08-16T00:21:10+00:00
Next Scan 2024-11-14T00:21:10+00:00

Last Successful Scan

Scanned2024-03-27T00:19:24+00:00
URL https://newsbreak.ng/robots.txt
Domain IPs 104.21.26.92, 172.67.135.210, 2606:4700:3031::6815:1a5c, 2606:4700:3034::ac43:87d2
Response IP 104.21.26.92
Found Yes
Hash 19ec11edc15085c0f645c25b0eac58d1afdac467fe3843b22c317df443700bde
SimHash 797d4e7141f1

Groups

*

Rule Path
Disallow */trackback/
Disallow */xmlrpc.php
Disallow /wp-*.php
Disallow /cgi-bin/
Disallow /wp-admin/
Allow */wp-content/uploads/

Other Records

Field Value
sitemap https://newsbreak.ng/sitemap.xml
sitemap https://newsbreak.ng/sitemap-home.xml
sitemap https://newsbreak.ng/sitemap-news.xml
sitemap https://newsbreak.ng/sitemap-posts.xml
sitemap https://newsbreak.ng/sitemap-pages.xml
sitemap https://newsbreak.ng/sitemap-categories.xml
sitemap https://newsbreak.ng/sitemap-tags.xml
sitemap https://newsbreak.ng/sitemap-archives.xml
sitemap https://newsbreak.ng/sitemap-custom-taxonomies.xml
sitemap https://newsbreak.ng/sitemap-attachment.xml

Comments

  • Squirrly SEO Robots