controlbooth.com
robots.txt

Robots Exclusion Standard data for controlbooth.com

Resource Scan

Scan Details

Site Domain controlbooth.com
Base Domain controlbooth.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-05-12T00:20:11+00:00
Next Scan 2025-06-11T00:20:11+00:00

Last Successful Scan

Scanned2025-03-21T22:12:57+00:00
URL https://controlbooth.com/robots.txt
Redirect https://www.controlbooth.com/robots.txt
Redirect Domain www.controlbooth.com
Redirect Base controlbooth.com
Domain IPs 104.26.10.118, 104.26.11.118, 172.67.71.27, 2606:4700:20::681a:a76, 2606:4700:20::681a:b76, 2606:4700:20::ac43:471b
Redirect IPs 104.26.10.118, 104.26.11.118, 172.67.71.27, 2606:4700:20::681a:a76, 2606:4700:20::681a:b76, 2606:4700:20::ac43:471b
Response IP 172.67.71.27
Found Yes
Hash acd44909465a5871199f0526c97292b1c73dbd2c72c929507e77f1704a57bde3
SimHash a41d49d62216

Groups

baiduspider

Rule Path
Disallow /

baiduspider-video

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /

yandex

Rule Path
Disallow /

*

Rule Path
Disallow /account*
Disallow /help*
Disallow /misc/quick-navigation-menu*
Disallow /login*
Disallow /logout*
Disallow /lost-password*
Disallow /register*
Disallow /reports*
Disallow /search*
Disallow /conversations*
Disallow /cron.php
Disallow /admin.php
Disallow /online/*
Disallow /recent-activity/*
Disallow /cdn-cgi/
Disallow /cdn-cgi/*

Other Records

Field Value
sitemap http://www.controlbooth.com/sitemap.php