cleyn-university.com
robots.txt

Robots Exclusion Standard data for cleyn-university.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	cleyn-university.com
Base Domain	cleyn-university.com
Scan Status	Ok
Last Scan	2025-10-27T13:24:36+00:00
Next Scan	2025-11-03T13:24:36+00:00

Last Scan

Scanned	2025-10-27T13:24:36+00:00
URL	https://cleyn-university.com/robots.txt
Domain IPs	104.161.46.138
Response IP	104.161.46.138
Found	Yes
Hash	300f7ec88f23673dd8044a6ba8af8f9464b44468fe37bbf8c550af2755ff64a8
SimHash	5a161d9acc96

Groups

*

Rule	Path
Disallow	/index.php?act=calendar
Disallow	/index.php?act=daffiliates
Disallow	/index.php?act=Post
Disallow	/index.php?act=Mobile&CODE=Post
Disallow	/index.php?act=Forward
Disallow	/index.php?act=Track
Disallow	/index.php?act=Print
Disallow	/index.php?act=Msg
Disallow	/index.php?act=Search
Disallow	/index.php?act=modcp
Disallow	/index.php?act=UserCP
Disallow	/admin.php

Rule

Path

Disallow

/index.php?act=calendar

Disallow

/index.php?act=daffiliates

Disallow

/index.php?act=Post

Disallow

/index.php?act=Mobile&CODE=Post

Disallow

/index.php?act=Forward

Disallow

/index.php?act=Track

Disallow

/index.php?act=Print

Disallow

/index.php?act=Msg

Disallow

/index.php?act=Search

Disallow

/index.php?act=modcp

Disallow

/index.php?act=UserCP

Disallow

/admin.php

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

spiderling

Rule	Path
Disallow	/

Rule

Path

Disallow

slurp

Rule	Path
Disallow	/

Rule

Path

Disallow

spbot

Rule	Path
Disallow	/

Rule

Path

Disallow

exabot

Rule	Path
Disallow	/

Rule

Path

Disallow

neevabot

Rule	Path
Disallow	/

Rule

Path

Disallow

petalbot

Rule	Path
Disallow	/

Rule

Path

Disallow

seekportbot

Rule	Path
Disallow	/

Rule

Path

Disallow

seekport crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

blexbot/1.0

Rule	Path
Disallow	/

Rule

Path

Disallow

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

crawler4j

Rule	Path
Disallow	/

Rule

Path

Disallow

dotbot

Rule	Path
Disallow	/

Rule

Path

Disallow

a6-indexer

Rule	Path
Disallow	/

Rule

Path

Disallow

alphaseobot

Rule	Path
Disallow	/

Rule

Path

Disallow

alphaseobot-sa

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

sogou web spider

Rule	Path
Disallow	/

Rule

Path

Disallow

sogou inst spider

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot-sa

Rule	Path
Disallow	/

Rule

Path

Disallow

seznambot

Rule	Path
Disallow	/

Rule

Path

Disallow

the knowledge ai

Rule	Path
Disallow	/

Rule

Path

Disallow

turnitinbot

Rule	Path
Disallow	/

Rule

Path

Disallow

ltx71 - (http://ltx71.com/)

Rule	Path
Disallow	/

Rule

Path

Disallow

blexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

awariorssbot

Rule	Path
Disallow	/

Rule

Path

Disallow

awariosmartbot

Rule	Path
Disallow	/

Rule

Path

Disallow

seokicks

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

piplbot

Rule	Path
Disallow	/

Rule

Path

Disallow

adsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

domaincrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

checkmarknetwork

Rule	Path
Disallow	/

Rule

Path

Disallow

dataforseobot

Rule	Path
Disallow	/

Rule

Path

Disallow

istellabot

Rule	Path
Disallow	/

Rule

Path

Disallow

pimeyes.com crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

oai-searchbot

Rule	Path
Disallow	/

Rule

Path

Disallow

Comments

Insanely, overwhelmingly aggressive AI scrapers
Non-NGINX cached robots.txt (1/2/2025)

cleyn-university.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

spiderling

slurp

spbot

exabot

neevabot

petalbot

seekportbot

seekport crawler

blexbot/1.0

bytespider

crawler4j

dotbot

a6-indexer

alphaseobot

alphaseobot-sa

semrushbot

bytespider

sogou web spider

sogou inst spider

semrushbot-sa

seznambot

the knowledge ai

turnitinbot

ltx71 - (http://ltx71.com/)

blexbot

awariorssbot

awariosmartbot

seokicks

ccbot

piplbot

adsbot

domaincrawler

checkmarknetwork

dataforseobot

istellabot

pimeyes.com crawler

ccbot

gptbot

claudebot

oai-searchbot

Comments

cleyn-university.com
robots.txt