linux.do
robots.txt

Robots Exclusion Standard data for linux.do

Archived Snapshots

Resource Scan

Scan Details

Site Domain	linux.do
Base Domain	linux.do
Scan Status	Ok
Last Scan	2024-05-12T00:49:22+00:00
Next Scan	2024-06-11T00:49:22+00:00

Last Scan

Scanned	2024-05-12T00:49:22+00:00
URL	https://linux.do/robots.txt
Domain IPs	104.26.12.174, 104.26.13.174, 172.67.74.154, 2606:4700:20::681a:cae, 2606:4700:20::681a:dae, 2606:4700:20::ac43:4a9a
Response IP	104.26.13.174
Found	Yes
Hash	0fd8ca4900b55d36f52ae401a92a6e5f7681ac2512492992743da301782a6d47
SimHash	299d1dc577d0

Groups

mauibot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

blexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

seo spider

Rule	Path
Disallow	/

Rule

Path

Disallow

*

Rule	Path
Disallow	/invites/
Disallow	/admin/
Disallow	/auth/
Disallow	/assets/browser-update*.js
Disallow	/email/
Disallow	/session
Disallow	/user-api-key
Disallow	/?api_key
Disallow	/?api_key*
Disallow	/badges
Disallow	/u/
Disallow	/my
Disallow	/search
Disallow	/tag/*/l
Disallow	/g
Disallow	/t//.rss
Disallow	/c/*.rss

Rule

Path

Disallow

/invites/

Disallow

/admin/

Disallow

/auth/

Disallow

/assets/browser-update*.js

Disallow

/email/

Disallow

/session

Disallow

/user-api-key

Disallow

/*?api_key*

Disallow

/*?*api_key*

Disallow

/badges

Disallow

/u/

Disallow

/my

Disallow

/search

Disallow

/tag/*/l

Disallow

/t/*/*.rss

Disallow

/c/*.rss

googlebot

Rule	Path
Disallow	/invites/
Disallow	/admin/
Disallow	/auth/
Disallow	/assets/browser-update*.js
Disallow	/email/
Disallow	/session
Disallow	/user-api-key
Disallow	/?api_key
Disallow	/?api_key*

Rule

Path

Disallow

/invites/

Disallow

/admin/

Disallow

/auth/

Disallow

/assets/browser-update*.js

Disallow

/email/

Disallow

/session

Disallow

/user-api-key

Disallow

/*?api_key*

Disallow

/*?*api_key*

Other Records

Field	Value
sitemap	https://linux.do/sitemap.xml

Field

Value

sitemap

https://linux.do/sitemap.xml

Comments

See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file

linux.dorobots.txt

Resource Scan

Scan Details

Last Scan

Groups

mauibot

semrushbot

ahrefsbot

blexbot

seo spider

*

googlebot

Other Records

Comments

linux.do
robots.txt