ancert.com
robots.txt

Robots Exclusion Standard data for ancert.com

Resource Scan

Scan Details

Site Domain ancert.com
Base Domain ancert.com
Scan Status Ok
Last Scan2024-10-25T08:35:39+00:00
Next Scan 2024-11-24T08:35:39+00:00

Last Scan

Scanned2024-10-25T08:35:39+00:00
URL https://www.ancert.com/robots.txt
Redirect https://www.ctnotariado.com/robots.txt
Redirect Domain www.ctnotariado.com
Redirect Base ctnotariado.com
Domain IPs 193.16.43.154
Redirect IPs 193.16.43.154
Response IP 193.16.43.154
Found Yes
Hash d19ceba8feceb5d7b1f3254c8003c82ba5d0624169295ac5dadd1ec2156c45a1
SimHash 87045ce2c313

Groups

baiduspider

Rule Path
Disallow /

baiduspider-cpro

Rule Path
Disallow /

baiduspider-favo

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /

baiduspider-mobile

Rule Path
Disallow /

baiduspider-news

Rule Path
Disallow /

baiduspider-video

Rule Path
Disallow /

yandex

Rule Path
Disallow /

stackrambler

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

sogou spider2

Rule Path
Disallow /

sogou blog

Rule Path
Disallow /

sogou news spider

Rule Path
Disallow /

sogou orion spider

Rule Path
Disallow /

jikespider

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

scrubby

Rule Path
Disallow /

robozilla

Rule Path
Disallow /

nutch

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

naverbot

Rule Path
Disallow /

yeti

Rule Path
Disallow /

asterias

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

*

Rule Path
Disallow
Disallow /es
Disallow /css
Disallow /js

Other Records

Field Value
crawl-delay 5