headcycle.com
robots.txt

Robots Exclusion Standard data for headcycle.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	headcycle.com
Base Domain	headcycle.com
Scan Status	Ok
Last Scan	2025-10-14T21:31:33+00:00
Next Scan	2025-11-13T21:31:33+00:00

Last Scan

Scanned	2025-10-14T21:31:33+00:00
URL	https://headcycle.com/robots.txt
Redirect	https://www.headcycle.com/robots.txt
Redirect Domain	www.headcycle.com
Redirect Base	headcycle.com
Domain IPs	13.248.132.87, 35.71.145.101, 75.2.97.79, 99.83.151.71
Redirect IPs	13.248.132.87, 35.71.145.101, 75.2.97.79, 99.83.151.71
Response IP	35.71.145.101
Found	Yes
Hash	09fe05087499cbd23a8fe30319c3d237b85c664b8d96eabc766629b1349a3eea
SimHash	525fd148fa51

Groups

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

yandexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

yandex

Rule	Path
Disallow	/

Rule

Path

Disallow

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

spbot

Rule	Path
Disallow	/

Rule

Path

Disallow

blexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

mauibot

Rule	Path
Disallow	/

Rule

Path

Disallow

baiduspider

Rule	Path
Disallow	/

Rule

Path

Disallow

barkrowler

Rule	Path
Disallow	/

Rule

Path

Disallow

barkrowler

Rule	Path
Disallow	/

Rule

Path

Disallow

barkrowler/0.9

Rule	Path
Disallow	/

Rule

Path

Disallow

petalbot

Rule	Path
Disallow	/

Rule

Path

Disallow

serpstatbot

Rule	Path
Disallow	/

Rule

Path

Disallow

serpstatbot/2.1

Rule	Path
Disallow	/

Rule

Path

Disallow

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

velenpublicwebcrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

dotbot

Rule	Path
Disallow	/

Rule

Path

Disallow

femtosearchbot/1.0

Rule	Path
Disallow	/

Rule

Path

Disallow

dataforseobot

Rule	Path
Disallow	/

Rule

Path

Disallow

dataforseobot/1.0

Rule	Path
Disallow	/

Rule

Path

Disallow

awariosmartbot

Rule	Path
Disallow	/

Rule

Path

Disallow

awariorssbot

Rule	Path
Disallow	/

Rule

Path

Disallow

amazonbot/0.1

Rule	Path
Disallow	/

Rule

Path

Disallow

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

my-tiny-bot

Rule	Path
Disallow	/

Rule

Path

Disallow

claude-web

Rule	Path
Disallow	/

Rule

Path

Disallow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

anthropic-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

imagesiftbot

Rule	Path
Disallow	/

Rule

Path

Disallow

seekportbot

Rule	Path
Disallow	/

Rule

Path

Disallow

friendlycrawler/1.0

Rule	Path
Disallow	/

Rule

Path

Disallow

friendlycrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

facebookexternalhit/1.1 (+http://www.facebook.com/externalhit_uatext.php)

Rule	Path
Disallow	/

Rule

Path

Disallow

facebookexternalhit/1.1

Rule	Path
Disallow	/

Rule

Path

Disallow

facebookcatalog/1.0

Rule	Path
Disallow	/

Rule

Path

Disallow

user-agent: velenpublicwebcrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

bdcbot

Rule	Path
Disallow	/

Rule

Path

Disallow

bdcbot/1.0

Rule	Path
Disallow	/

Rule

Path

Disallow

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

dataforseobot/1.0

Rule	Path
Disallow	/

Rule

Path

Disallow

amazonbot/0.1

Rule	Path
Disallow	/

Rule

Path

Disallow

dataforseobot

Rule	Path
Disallow	/

Rule

Path

Disallow

femtosearchbot

Rule	Path
Disallow	/

Rule

Path

Disallow

anthropic-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

claude-searchbot/1.0

Rule	Path
Disallow	/

Rule

Path

Disallow

*

Rule

Path

Disallow

/login

Disallow

/sign_up

Disallow

/wordcloud

Disallow

/wordcloud2

Disallow

/h/*/new

Disallow

/h/*/wordcloud

Disallow

/domain/*/new

Disallow

/domain/*/wordcloud

Disallow

/user/*/new

Disallow

/user/*/comments

Disallow

/user/*/submitted

Disallow

/user/*/wordcloud

Disallow

/comment_share/*

Comments

Disallow: /rss
Disallow: /search
Disallow: /h/*/search
Disallow: /h/*/rss
Disallow: /domain/*/search
Disallow: /user/*/search
this works, but the /search rule above actually works first
Disallow: /*?q=
no idea if this does the same thing or is more acurate
Disallow: /*?q=*
Disallow: /*?after=
Disallow: /*?before=
Disallow all query strings - might not be the best solution
Disallow: /*?*

headcycle.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

mj12bot

yandexbot

yandex

ahrefsbot

spbot

blexbot

semrushbot

mauibot

baiduspider

barkrowler

barkrowler

barkrowler/0.9

petalbot

serpstatbot

serpstatbot/2.1

bytespider

velenpublicwebcrawler

dotbot

femtosearchbot/1.0

dataforseobot

dataforseobot/1.0

awariosmartbot

awariorssbot

amazonbot/0.1

amazonbot

my-tiny-bot

claude-web

gptbot

ccbot

anthropic-ai

google-extended

imagesiftbot

seekportbot

friendlycrawler/1.0

friendlycrawler

facebookexternalhit/1.1 (+http://www.facebook.com/externalhit_uatext.php)

facebookexternalhit/1.1

facebookcatalog/1.0

user-agent: velenpublicwebcrawler

bdcbot

bdcbot/1.0

claudebot

dataforseobot/1.0

amazonbot/0.1

dataforseobot

femtosearchbot

anthropic-ai

claude-searchbot/1.0

*

Comments

headcycle.com
robots.txt