crunchboy.com
robots.txt

Robots Exclusion Standard data for crunchboy.com

Resource Scan

Scan Details

Site Domain crunchboy.com
Base Domain crunchboy.com
Scan Status Ok
Last Scan2025-11-07T16:25:32+00:00
Next Scan 2025-12-07T16:25:32+00:00

Last Scan

Scanned2025-11-07T16:25:32+00:00
URL https://crunchboy.com/robots.txt
Redirect https://www.crunchboy.com/robots.txt
Redirect Domain www.crunchboy.com
Redirect Base crunchboy.com
Domain IPs 104.21.5.193, 172.67.154.174, 2606:4700:3034::6815:5c1, 2606:4700:3036::ac43:9aae
Redirect IPs 104.21.5.193, 172.67.154.174, 2606:4700:3034::6815:5c1, 2606:4700:3036::ac43:9aae
Response IP 172.67.154.174
Found Yes
Hash 312cf2f3bfab097f69f20683728ee168ed4cacf714da0ebcb493f209710efa5b
SimHash 4949d2aac590

Groups

*

Rule Path
Allow /
Disallow *buy_download%3D*
Disallow *buy_stream%3D*
Disallow *logout%3D1*
Disallow *sessid%3D*
Disallow *redirect%3D*
Disallow *others%3D1*
Disallow *video_id%3D*
Disallow /*/videos/regarder/*
Disallow /*/videos/my-history
Disallow /*/videos/my-unseen
Disallow /*/videos/i-will-like
Disallow /*/dvd/my-history
Disallow /*/dvd/my-unseen
Disallow /*/dvd/i-will-like
Disallow /en/histoires*
Disallow /es/histoires*
Disallow /it/histoires*
Disallow /de/histoires*
Disallow /fr/histoires*

ia_archiver

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.crunchboy.com/sitemap.xml