discoverhumboldt.com
robots.txt

Robots Exclusion Standard data for discoverhumboldt.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	discoverhumboldt.com
Base Domain	discoverhumboldt.com
Scan Status	Ok
Last Scan	2024-05-25T02:35:22+00:00
Next Scan	2024-06-01T02:35:22+00:00

Last Scan

Scanned	2024-05-25T02:35:22+00:00
URL	https://discoverhumboldt.com/robots.txt
Domain IPs	18.65.25.102, 18.65.25.21, 18.65.25.26, 18.65.25.86
Response IP	108.157.52.65
Found	Yes
Hash	c05aafe8ced40d3c62516df7f8e833b10cf28fce9e4ac63b0418fed63a2c27b1
SimHash	2a9d0d8576c0

Groups

mediapartners-google*

Rule	Path
Disallow	/events/*

Rule

Path

Disallow

/events/*

googlebot-mobile

Rule	Path
Disallow	/events/*

Rule

Path

Disallow

/events/*

adsbot-google

Rule	Path
Disallow	/events/*

Rule

Path

Disallow

/events/*

googlebot

Rule	Path
Disallow	/events/*

Rule

Path

Disallow

/events/*

bingbot

Rule	Path
Disallow	/events/*

Rule

Path

Disallow

/events/*

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

10

semrushbot

Rule	Path
Disallow	/events/*

Rule

Path

Disallow

/events/*

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

10

ahrefsbot

Rule	Path
Disallow	/events/*

Rule

Path

Disallow

/events/*

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

10

yandexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

*

Rule	Path
Disallow	/events/*

Rule

Path

Disallow

/events/*

Back to top

Other Records

Field	Value
sitemap	https://DiscoverHumboldt.com/rss/all

Field

Value

sitemap

https://DiscoverHumboldt.com/rss/all

Back to top

Comments

See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file
To ban all spiders from the entire site uncomment the next two lines:
User-agent: *
Disallow: /

Back to top

discoverhumboldt.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

mediapartners-google*

googlebot-mobile

adsbot-google

googlebot

bingbot

Other Records

semrushbot

Other Records

ahrefsbot

Other Records

yandexbot

*

Other Records

Comments

discoverhumboldt.com
robots.txt