cricadium.com
robots.txt

Robots Exclusion Standard data for cricadium.com

Resource Scan

Scan Details

Site Domain cricadium.com
Base Domain cricadium.com
Scan Status Ok
Last Scan2024-11-15T03:06:17+00:00
Next Scan 2024-11-22T03:06:17+00:00

Last Scan

Scanned2024-11-15T03:06:17+00:00
URL https://cricadium.com/robots.txt
Domain IPs 104.26.2.189, 104.26.3.189, 172.67.68.145, 2606:4700:20::681a:2bd, 2606:4700:20::681a:3bd, 2606:4700:20::ac43:4491
Response IP 104.26.3.189
Found Yes
Hash 3aebfd89cab6af8c8fd93753dcb9944d012b18eaf7d9f98dde7e747210dbb087
SimHash c00a1a60ec91

Groups

*

Rule Path
Allow /
Disallow /*?p=*
Disallow /*%26p%3D*
Disallow /*?s=*
Disallow /*%26s%3D*
Disallow /?author=*
Disallow /*wp-comments*
Disallow /*wp-trackback*
Disallow /*wp-feed*
Disallow /*replytocom%3D*
Disallow /*?preview=*
Disallow /*%26preview%3D*
Disallow /*add-to-cart%3D*
Disallow /*add_to_wishlist%3D*
Disallow /*cart/*
Disallow /*checkout/*
Disallow /*my-account/*
Disallow /*myaccount/*
Allow /*/plugins/*

grapeshot

Rule Path
Disallow

semrushbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.cricadium.com/sitemap_index.xml
sitemap https://www.cricadium.com/news-sitemap.xml