theafricareport.com
robots.txt

Robots Exclusion Standard data for theafricareport.com

Resource Scan

Scan Details

Site Domain theafricareport.com
Base Domain theafricareport.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-10-20T00:42:38+00:00
Next Scan 2025-01-18T00:42:38+00:00

Last Successful Scan

Scanned2024-03-24T20:59:29+00:00
URL https://theafricareport.com/robots.txt
Redirect https://www.theafricareport.com/robots.txt
Redirect Domain www.theafricareport.com
Redirect Base theafricareport.com
Domain IPs 104.18.0.210, 104.18.1.210, 2606:4700::6812:1d2, 2606:4700::6812:d2
Redirect IPs 104.18.0.210, 104.18.1.210, 2606:4700::6812:1d2, 2606:4700::6812:d2
Response IP 104.18.1.210
Found Yes
Hash 36a62704fdc02507049a0207c0f08447f5a89cfea0cb77c98610893bdc71785a
SimHash c3155d975265

Groups

*

Rule Path
Allow /

grapeshot

Rule Path
Disallow

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

googlebot

Rule Path
Disallow /*.php$
Disallow /*.inc$
Disallow /*.gz$
Disallow /*.swf$
Disallow /*.wmv$
Disallow /*.cgi$
Disallow /*.xhtml$

googlebot-image

Rule Path
Disallow
Allow /*

googlebot-image

Rule Path
Allow /cdn-cgi/image/

mediapartners-google*

Rule Path
Disallow
Allow /*

Other Records

Field Value
sitemap https://www.theafricareport.com/sitemap/general.xml
sitemap https://www.theafricareport.com/sitemap/googlenews.xml

Warnings

  • 13 invalid lines.