cumm.co.za
robots.txt

Robots Exclusion Standard data for cumm.co.za

Resource Scan

Scan Details

Site Domain cumm.co.za
Base Domain cumm.co.za
Scan Status Ok
Last Scan2025-03-30T19:54:23+00:00
Next Scan 2025-04-29T19:54:23+00:00

Last Scan

Scanned2025-03-30T19:54:23+00:00
URL https://cumm.co.za/robots.txt
Redirect https://www.cumm.co.za//robots.txt
Redirect Domain www.cumm.co.za
Redirect Base cumm.co.za
Domain IPs 104.21.33.130, 172.67.145.49, 2606:4700:3033::ac43:9131, 2606:4700:3037::6815:2182
Redirect IPs 104.21.33.130, 172.67.145.49, 2606:4700:3033::ac43:9131, 2606:4700:3037::6815:2182
Response IP 104.21.33.130
Found Yes
Hash c4140a7fd366b1458e80dcfc4073692ead554cee68f5e413e41aa04cfc33dcaf
SimHash 931445cae41b

Groups

ia_archiver

Rule Path
Disallow /

ia_archiver-web.archive.org

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

*

Rule Path
Disallow /order/

*

Rule Path
Disallow /search/

petalbot
nbot

Rule Path
Disallow /
Disallow /

bytespider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

spbot

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

yandex

Rule Path
Disallow /

proximic

Rule Path
Disallow /php/