connectingafrica.com
robots.txt

Robots Exclusion Standard data for connectingafrica.com

Resource Scan

Scan Details

Site Domain connectingafrica.com
Base Domain connectingafrica.com
Scan Status Ok
Last Scan2024-05-01T11:48:00+00:00
Next Scan 2024-05-31T11:48:00+00:00

Last Scan

Scanned2024-05-01T11:48:00+00:00
URL https://www.connectingafrica.com/robots.txt
Domain IPs 104.18.34.238, 172.64.153.18, 2606:4700:4400::6812:22ee, 2606:4700:4400::ac40:9912
Response IP 172.64.153.18
Found Yes
Hash 885e2fe249ddfe7a8540b3773a3fe5023c37305e73e77ab6b481f1446741447d
SimHash ee74dcc64002

Groups

*

Rule Path
Disallow /ad_view.asp
Disallow /ad_redirect.asp
Disallow /ad_build.asp
Disallow /login.asp
Disallow /search.asp
Disallow /email.asp
Disallow /register.asp
Disallow /lg_redirect.asp
Disallow /complink_redirect.asp
Disallow /tv/
Disallow /ng_asset.asp
Disallow /ng_checkforthumbnail.asp
Disallow /ng_loginsuccess.asp
Disallow /ng_logout.asp
Disallow /ng_newsletterprefs.asp
Disallow /ng_profile.asp
Disallow /ng_register.asp
Disallow /ng_thumbnail.asp
Disallow /ng_verifytoken.asp
Disallow /ngassetform_xml.asp
Disallow /ngnewsletterform_xml.asp
Disallow /ngprofileform_xml.asp
Disallow /ngregform_xml.asp
Disallow /lgservice/

applebot

Rule Path
Disallow /messages.asp

crazywebcrawler

Rule Path
Disallow /

domain re-animator

Rule Path
Disallow /

linkscrawler

Rule Path
Disallow /

link sleuth

Rule Path
Disallow /

linkspammer

Rule Path
Disallow /

linkwalker

Rule Path
Disallow /

mail.ru_bot

Rule Path
Disallow /

rankvalbot

Rule Path
Disallow /

vegi bot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.connectingafrica.com/sitemap.xml
sitemap https://www.connectingafrica.com/news-sitemap.asp