capetown-webcam.com
robots.txt

Robots Exclusion Standard data for capetown-webcam.com

Resource Scan

Scan Details

Site Domain capetown-webcam.com
Base Domain capetown-webcam.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-10-11T05:40:28+00:00
Next Scan 2025-01-09T05:40:28+00:00

Last Successful Scan

Scanned2024-03-16T05:38:20+00:00
URL https://capetown-webcam.com/robots.txt
Redirect https://www.capetown-webcam.com/robots.txt
Redirect Domain www.capetown-webcam.com
Redirect Base capetown-webcam.com
Domain IPs 217.160.0.73
Redirect IPs 217.160.0.73
Response IP 217.160.0.73
Found Yes
Hash 509c55e876a86092673ffa2c28401ce575b1b26b484a1860e507ece46b4706f3
SimHash fb1dc6b15910

Groups

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

seoscanners

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

linkdexbot

Rule Path
Disallow /

python-requests

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

yandex

Rule Path
Disallow /

baidu

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

*

Rule Path
Allow /*.js*
Allow /*.css*
Allow /*.png*
Allow /*.jpg*
Allow /*.gif*
Disallow /administrator/
Disallow /bin/
Disallow /cache/
Disallow /cli/
Disallow /components/
Disallow /includes/
Disallow /installation/
Disallow /language/
Disallow /layouts/
Disallow /libraries/
Disallow /logs/
Disallow /modules/
Disallow /plugins/
Disallow /tmp/
Disallow /*?*view=category
Disallow /*?*view=article
Disallow /?jsn_setmobile=no
Disallow /*?rCH=2
Disallow /*?rCH=-2

Other Records

Field Value
sitemap https://www.capetown-webcam.com/sitemap.xml