ccf.georgetown.edu
robots.txt

Robots Exclusion Standard data for ccf.georgetown.edu

Archived Snapshots

Resource Scan

Scan Details

Site Domain	ccf.georgetown.edu
Base Domain	georgetown.edu
Scan Status	Ok
Last Scan	2025-02-18T00:04:02+00:00
Next Scan	2025-03-20T00:04:02+00:00

Last Scan

Scanned	2025-02-18T00:04:02+00:00
URL	https://ccf.georgetown.edu/robots.txt
Domain IPs	23.185.0.2, 2620:12a:8000::2, 2620:12a:8001::2
Response IP	23.185.0.2
Found	Yes
Hash	b404108f7e8208e0e06678a62f9bba163000c83dcc06fb0353ef672d191d90e0
SimHash	784550106b95

Groups

*

Rule	Path
Disallow	/wp-admin/
Disallow	/wp-includes/

Rule

Path

Disallow

/wp-admin/

Disallow

/wp-includes/

*

Rule	Path
Allow	/

Rule

Path

Allow

googlebot

Rule	Path
Allow	/

Rule

Path

Allow

bingbot

Rule	Path
Allow	/

Rule

Path

Allow

yahoo pipes 2.0

Rule	Path
Allow	/

Rule

Path

Allow

googlebot-mobile

Rule	Path
Allow	/

Rule

Path

Allow

baiduspider+

Rule	Path
Allow	/

Rule

Path

Allow

mozilla/2.0

Rule	Path
Allow	/

Rule

Path

Allow

charlotte

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

converacrawler/0.9e

Rule	Path
Disallow	/

Rule

Path

Disallow

gigabot

Rule	Path
Disallow	/

Rule

Path

Disallow

vse/1.0

Rule	Path
Allow	/

Rule

Path

Allow

gsa-crawler

Rule	Path
Allow	/

Rule

Path

Allow

Comments

STANDARD

ccf.georgetown.edurobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

*

googlebot

bingbot

yahoo pipes 2.0

googlebot-mobile

baiduspider+

mozilla/2.0

charlotte

mj12bot

converacrawler/0.9e

gigabot

vse/1.0

gsa-crawler

Comments

ccf.georgetown.edu
robots.txt