allafrica.com
robots.txt

Robots Exclusion Standard data for allafrica.com

Resource Scan

Scan Details

Site Domain allafrica.com
Base Domain allafrica.com
Scan Status Ok
Last Scan2024-11-13T12:42:46+00:00
Next Scan 2024-11-20T12:42:46+00:00

Last Scan

Scanned2024-11-13T12:42:46+00:00
URL https://allafrica.com/robots.txt
Domain IPs 173.203.36.104
Response IP 173.203.36.104
Found Yes
Hash 5f3a21d1b5252f4ef37cd7e0bca51f66dc7de15aeb0842150873e33db69d6368
SimHash 69089067c011

Groups

*

Rule Path
Disallow /stories/printable/
Disallow /misc/forms/
Disallow /misc/error/
Disallow /user/
Disallow /newsletter/
Disallow /commerce/
Disallow /comments/new/
Disallow /comments/abusive/
Disallow /create/
Disallow /report/
Disallow /abusive/
Disallow /search/advanced.html
Disallow /search/adv_search.html
Disallow /static/images/misc/s-trans.gif
Disallow /tools/apache/
Disallow /tools/mason/
Disallow /tools/qmail/
Disallow /thread/comment/
Disallow /search/
Disallow /_wb$
Disallow /includes/
Disallow *.json$

Other Records

Field Value
crawl-delay 1

blp_bbot/0.1

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

mediapartners-google

Rule Path
Allow /

bingbot

Rule Path
Disallow /stories/printable/
Disallow /misc/forms/
Disallow /misc/error/
Disallow /user/
Disallow /newsletter/
Disallow /commerce/
Disallow /comments/new/
Disallow /comments/abusive/
Disallow /create/
Disallow /report/
Disallow /abusive/
Disallow /search/advanced.html
Disallow /search/adv_search.html
Disallow /static/images/misc/s-trans.gif
Disallow /tools/apache/
Disallow /tools/mason/
Disallow /tools/qmail/
Disallow /thread/comment/
Disallow /search/
Disallow /_wb$
Disallow /includes/
Disallow *.json$

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://allafrica.com/misc/sitemap/categories.xml
sitemap https://allafrica.com/misc/sitemap/aans-urls-en.xml