/.well-known/

Log In Sign Up

ngrguardiannews.com
robots.txt

Robots Exclusion Standard data for ngrguardiannews.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	ngrguardiannews.com
Base Domain	ngrguardiannews.com
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Couldn't connect to server.
Last Scan	2024-11-07T21:20:56+00:00
Next Scan	2025-02-05T21:20:56+00:00

Last Successful Scan

Scanned	2024-07-11T21:13:36+00:00
URL	http://ngrguardiannews.com/robots.txt
Redirect	http://guardian.ng/robots.txt
Redirect Domain	guardian.ng
Redirect Base	guardian.ng
Domain IPs	35.186.215.69
Redirect IPs	34.120.183.76
Response IP	34.120.183.76
Found	Yes
Hash	da81873c85a6abb26ca21628ab13b3e4e35f7e2d9841bf5604ae35f4fc89ce8d
SimHash	084cc840a153

Groups

*

Rule

Path

Disallow

ccbot

Rule

Path

Disallow

/

google-extended

Rule

Path

Disallow

/

gptbot

Rule

Path

Disallow

/

Back to top

Other Records

Field

Value

sitemap

http://guardian.ng/sitemap_index.xml

Back to top

Comments

START YOAST BLOCK
---------------------------
---------------------------
END YOAST BLOCK

Back to top