sheriahub.com
robots.txt

Robots Exclusion Standard data for sheriahub.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	sheriahub.com
Base Domain	sheriahub.com
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2024-09-04T13:06:21+00:00
Next Scan	2024-11-03T13:06:21+00:00

Last Successful Scan

Scanned	2024-06-14T13:04:55+00:00
URL	https://sheriahub.com/robots.txt
Domain IPs	104.21.33.215, 172.67.192.214, 2606:4700:3030::ac43:c0d6, 2606:4700:3037::6815:21d7
Response IP	172.67.192.214
Found	Yes
Hash	dc22809b1bb471c924f18fdc2295f4855a792713267de7b7933c13c04dc63545
SimHash	580577c1c795

Groups

*

Rule	Path
Disallow	/dashboard/*

Rule

Path

Disallow

/dashboard/*

*

Rule	Path
Disallow	/js/improvement.js*

Rule

Path

Disallow

/js/improvement.js*

*

Rule	Path
Disallow	/cgi-bin/*

Rule

Path

Disallow

/cgi-bin/*

ia_archiver

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

criteobot/0.1

Rule	Path
Disallow

Rule

Path

Disallow

applebot

Rule	Path
Allow	/

Rule

Path

Allow

baiduspider

Rule	Path
Allow	/

Rule

Path

Allow

bingbot

Rule	Path
Allow	/

Rule

Path

Allow

discordbot

Rule	Path
Allow	/

Rule

Path

Allow

googlebot

Rule	Path
Allow	/

Rule

Path

Allow

mediapartners-google

Rule	Path
Allow	/

Rule

Path

Allow

linkedinbot

Rule	Path
Allow	/

Rule

Path

Allow

msnbot

Rule	Path
Allow	/

Rule

Path

Allow

naverbot

Rule	Path
Allow	/

Rule

Path

Allow

slurp

Rule	Path
Allow	/

Rule

Path

Allow

telegrambot

Rule	Path
Allow	/

Rule

Path

Allow

twitterbot

Rule	Path
Allow	/

Rule

Path

Allow

yandex

Rule	Path
Allow	/

Rule

Path

Allow

yeti

Rule	Path
Allow	/

Rule

Path

Allow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

*

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://sheriahub.com/sitemap.xml

Field

Value

sitemap

https://sheriahub.com/sitemap.xml

Comments

Notice: Collection of data on Sheriahub through automated means is
prohibited unless you have express written permission from Sheriahub
and may only be conducted for the limited purpose contained in said
permission.

sheriahub.comrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

*

*

ia_archiver

semrushbot

criteobot/0.1

applebot

baiduspider

bingbot

discordbot

googlebot

mediapartners-google

linkedinbot

msnbot

naverbot

slurp

telegrambot

twitterbot

yandex

yeti

gptbot

*

Other Records

Comments

sheriahub.com
robots.txt