cureus.com
robots.txt

Robots Exclusion Standard data for cureus.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	cureus.com
Base Domain	cureus.com
Scan Status	Ok
Last Scan	2024-06-03T07:15:23+00:00
Next Scan	2024-06-10T07:15:23+00:00

Last Scan

Scanned	2024-06-03T07:15:23+00:00
URL	https://cureus.com/robots.txt
Redirect	https://www.cureus.com/robots.txt
Redirect Domain	www.cureus.com
Redirect Base	cureus.com
Domain IPs	104.22.4.111, 104.22.5.111, 172.67.8.5, 2606:4700:10::6816:46f, 2606:4700:10::6816:56f, 2606:4700:10::ac43:805
Redirect IPs	104.22.4.111, 104.22.5.111, 172.67.8.5, 2606:4700:10::6816:46f, 2606:4700:10::6816:56f, 2606:4700:10::ac43:805
Response IP	104.22.4.111
Found	Yes
Hash	e2ab7c3159e4075123dfd7d825f880702dbf7c85d62baba6a694db7c8b0fb433
SimHash	a60d0dad64d0

Groups

*

Rule	Path
Disallow	/search
Disallow	/users/*/similar_profiles
Disallow	/invite_colleague
Disallow	/people
Disallow	/cureus_career_center

Rule

Path

Disallow

/search

Disallow

/users/*/similar_profiles

Disallow

/invite_colleague

Disallow

/people

Disallow

/cureus_career_center

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	8

Field

Value

crawl-delay

8

semrushbot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	8

Field

Value

crawl-delay

8

dotbot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	8

Field

Value

crawl-delay

8

Back to top

Other Records

Field	Value
sitemap	https://cureus-production.s3.amazonaws.com/sitemaps/sitemap.xml

Field

Value

sitemap

https://cureus-production.s3.amazonaws.com/sitemaps/sitemap.xml

Back to top

Comments

See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
To ban all spiders from the entire site uncomment the next two lines:

Back to top

cureus.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

ahrefsbot

Other Records

semrushbot

Other Records

dotbot

Other Records

Other Records

Comments

cureus.com
robots.txt