ccava.net
robots.txt

Robots Exclusion Standard data for ccava.net

Resource Scan

Scan Details

Site Domain ccava.net
Base Domain ccava.net
Scan Status Ok
Last Scan2025-12-16T09:39:05+00:00
Next Scan 2025-12-23T09:39:05+00:00

Last Scan

Scanned2025-12-16T09:39:05+00:00
URL https://ccava.net/robots.txt
Domain IPs 240d:c010:81:2:2c00::dd, 43.174.246.27, 43.174.247.27
Response IP 43.174.247.27
Found Yes
Hash 66059d4ebdcb166e65007ea828575f9e2bf492125f2e3c66ee1f5dcf9f8f5e76
SimHash 4845515747f9

Groups

baiduspider

Rule Path
Allow /

googlebot

Rule Path
Allow /

sogou web spider

Rule Path
Allow /

yahoo! slurp

Rule Path
Allow /

bingbot

Rule Path
Allow /

yandexbot

Rule Path
Allow /

yisouspider

Rule Path
Disallow /

youdaobot

Rule Path
Allow /

jikespider

Rule Path
Allow /

dnspod

Rule Path
Disallow /

aspiegelbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

etaospider

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

msnbot

Rule Path
Disallow /

easouspider

Rule Path
Disallow /

*

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.ccava.net/sitemap.xml
sitemap https://www.ccava.net/sitemap.txt

Warnings

  • 2 invalid lines.