ct.iscute.com
robots.txt

Robots Exclusion Standard data for ct.iscute.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	ct.iscute.com
Base Domain	iscute.com
Scan Status	Ok
Last Scan	2025-10-19T00:48:05+00:00
Next Scan	2025-11-18T00:48:05+00:00

Last Scan

Scanned	2025-10-19T00:48:05+00:00
URL	https://ct.iscute.com/robots.txt
Domain IPs	172.97.101.77
Response IP	172.97.101.77
Found	Yes
Hash	cbc616a897c218e84b689e756f0318e4c20ff2c160a74add18c43fd733a929d2
SimHash	2e06848d4131

Groups

baiduspider

Rule	Path
Disallow	/

Rule

Path

Disallow

flamingo_searchengine+(+http://www.flamingosearch.com/bot)

Rule	Path
Disallow	/

Rule

Path

Disallow

whitevector crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

yandex

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
crawl-delay	220

Field

Value

crawl-delay

220

slurp

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

ubicrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

doc

Rule	Path
Disallow	/

Rule

Path

Disallow

zao

Rule	Path
Disallow	/

Rule

Path

Disallow

mediapartners-google

Rule	Path
Disallow

Rule

Path

Disallow

*

Rule	Path
Disallow	/cache/
Disallow	/layouts/bookmarked.php

Rule

Path

Disallow

/cache/

Disallow

/layouts/bookmarked.php

bingbot

Rule	Path
Disallow	/cache/

Rule

Path

Disallow

/cache/

Other Records

Field	Value
crawl-delay	5

Field

Value

crawl-delay

Comments

Crawlers that are kind enough to obey, but which we'd rather not have
unless they're feeding search engines.

ct.iscute.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

baiduspider

flamingo_searchengine+(+http://www.flamingosearch.com/bot)

whitevector crawler

yandex

Other Records

slurp

Other Records

mj12bot

ubicrawler

doc

zao

mediapartners-google

*

bingbot

Other Records

Comments

ct.iscute.com
robots.txt