capecod.edu
robots.txt

Robots Exclusion Standard data for capecod.edu

Archived Snapshots

Resource Scan

Scan Details

Site Domain	capecod.edu
Base Domain	capecod.edu
Scan Status	Ok
Last Scan	2024-09-08T20:13:05+00:00
Next Scan	2024-10-08T20:13:05+00:00

Last Scan

Scanned	2024-09-08T20:13:05+00:00
URL	https://capecod.edu/robots.txt
Domain IPs	174.129.93.217, 34.228.91.126
Response IP	174.129.93.217
Found	Yes
Hash	9b46457a2559ef58ba5bcdce7707e763b8cb711e35b95da62ef0f676da55c2cf
SimHash	6479de164347

Groups

googlebot

Rule	Path
Allow	/
Disallow	/amt-gened/
Disallow	/catalog-archive/
Disallow	/catalog-update/

Rule

Path

Allow

Disallow

/amt-gened/

Disallow

/catalog-archive/

Disallow

/catalog-update/

Other Records

Field	Value
crawl-delay	2

Field

Value

crawl-delay

googlebot-image

Rule	Path
Allow	/
Disallow	/amt-gened/
Disallow	/catalog-archive/
Disallow	/catalog-update/

Rule

Path

Allow

Disallow

/amt-gened/

Disallow

/catalog-archive/

Disallow

/catalog-update/

Other Records

Field	Value
crawl-delay	2

Field

Value

crawl-delay

duckduckbot

Rule	Path
Allow	/
Disallow	/amt-gened/
Disallow	/catalog-archive/
Disallow	/catalog-update/

Rule

Path

Allow

Disallow

/amt-gened/

Disallow

/catalog-archive/

Disallow

/catalog-update/

Other Records

Field	Value
crawl-delay	2

Field

Value

crawl-delay

bingbot

Rule	Path
Allow	/
Disallow	/amt-gened/
Disallow	/catalog-archive/
Disallow	/catalog-update/

Rule

Path

Allow

Disallow

/amt-gened/

Disallow

/catalog-archive/

Disallow

/catalog-update/

Other Records

Field	Value
crawl-delay	2

Field

Value

crawl-delay

msnbot

Rule	Path
Allow	/
Disallow	/amt-gened/
Disallow	/catalog-archive/
Disallow	/catalog-update/

Rule

Path

Allow

Disallow

/amt-gened/

Disallow

/catalog-archive/

Disallow

/catalog-update/

Other Records

Field	Value
crawl-delay	2

Field

Value

crawl-delay

funnelback

Rule	Path
Allow	/
Disallow	/amt-gened/
Disallow	/catalog-archive/
Disallow	/catalog-update/

Rule

Path

Allow

Disallow

/amt-gened/

Disallow

/catalog-archive/

Disallow

/catalog-update/

terminalfour nutch spider

Rule	Path
Allow	/
Disallow	/amt-gened/
Disallow	/catalog-archive/
Disallow	/catalog-archives/
Disallow	/catalog-update/

Rule

Path

Allow

Disallow

/amt-gened/

Disallow

/catalog-archive/

Disallow

/catalog-archives/

Disallow

/catalog-update/

*

Rule	Path
Disallow	/

Rule

Path

Disallow

capecod.edurobots.txt

Resource Scan

Scan Details

Last Scan

Groups

googlebot

Other Records

googlebot-image

Other Records

duckduckbot

Other Records

bingbot

Other Records

msnbot

Other Records

funnelback

terminalfour nutch spider

*

capecod.edu
robots.txt