cladglobal.com
robots.txt

Robots Exclusion Standard data for cladglobal.com

Resource Scan

Scan Details

Site Domain cladglobal.com
Base Domain cladglobal.com
Scan Status Ok
Last Scan2024-10-27T07:11:48+00:00
Next Scan 2024-11-26T07:11:48+00:00

Last Scan

Scanned2024-10-27T07:11:48+00:00
URL https://cladglobal.com/robots.txt
Redirect https://www.cladglobal.com/robots.txt
Redirect Domain www.cladglobal.com
Redirect Base cladglobal.com
Domain IPs 104.21.65.54, 172.67.141.21, 2606:4700:3032::ac43:8d15, 2606:4700:3036::6815:4136
Redirect IPs 104.21.65.54, 172.67.141.21, 2606:4700:3032::ac43:8d15, 2606:4700:3036::6815:4136
Response IP 104.21.65.54
Found Yes
Hash b35ab00a4c826ee9a27d9a40d710687d5bf52f4264c40693d1ff85e3ac5e280a
SimHash 521dc472e093

Groups

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

blp_bbot/0.1

Rule Path
Disallow /

flamingo_searchengine

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

cyberalert

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

webcrawler

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

cityreview

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

shopwiki

Rule Path
Disallow /

yandex

Rule Path
Disallow /

discobot

Rule Path
Disallow /

birubot

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /
Disallow /

gosospider

Rule Path
Disallow /

steeler

Rule Path
Disallow /

summify

Rule Path
Disallow /

accelobot

Rule Path
Disallow /

googlebot-image

Rule Path
Disallow /images/dir/

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Warnings

  • 2 invalid lines.
  • `user agent` is not a known field.