ngrguardiannews.com
robots.txt

Robots Exclusion Standard data for ngrguardiannews.com

Resource Scan

Scan Details

Site Domain ngrguardiannews.com
Base Domain ngrguardiannews.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-11-07T21:20:56+00:00
Next Scan 2025-02-05T21:20:56+00:00

Last Successful Scan

Scanned2024-07-11T21:13:36+00:00
URL http://ngrguardiannews.com/robots.txt
Redirect http://guardian.ng/robots.txt
Redirect Domain guardian.ng
Redirect Base guardian.ng
Domain IPs 35.186.215.69
Redirect IPs 34.120.183.76
Response IP 34.120.183.76
Found Yes
Hash da81873c85a6abb26ca21628ab13b3e4e35f7e2d9841bf5604ae35f4fc89ce8d
SimHash 084cc840a153

Groups

*

Rule Path
Disallow

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap http://guardian.ng/sitemap_index.xml

Comments

  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK