tutormate.org.uk
robots.txt

Robots Exclusion Standard data for tutormate.org.uk

Archived Snapshots

Resource Scan

Scan Details

Site Domain	tutormate.org.uk
Base Domain	tutormate.org.uk
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Couldn't establish SSL connection.
Last Scan	2024-10-07T08:30:39+00:00
Next Scan	2024-12-06T08:30:39+00:00

Last Successful Scan

Scanned	2024-07-17T02:22:27+00:00
URL	https://tutormate.org.uk/robots.txt
Redirect	https://www.tutormate.org.uk/robots.txt
Redirect Domain	www.tutormate.org.uk
Redirect Base	tutormate.org.uk
Domain IPs	3.10.28.251
Redirect IPs	3.10.28.251
Response IP	3.10.28.251
Found	Yes
Hash	b3a7d37871df642721cf349899cf4573da3fba4771b23ef20c2f3b5b8cb67f42
SimHash	660ada20c1a3

Groups

yandex

Rule	Path
Disallow	/

Rule

Path

Disallow

sistrix

Rule	Path
Disallow	/

Rule

Path

Disallow

hubspot webcrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

panscient.com

Rule	Path
Disallow	/

Rule

Path

Disallow

baiduspider

Rule	Path
Disallow	/

Rule

Path

Disallow

sogou spider

Rule	Path
Disallow	/

Rule

Path

Disallow

facebookexternalhit/1.1

Rule	Path
Disallow	/

Rule

Path

Disallow

semetrical

Rule	Path
Disallow	/

Rule

Path

Disallow

riddler

Rule	Path
Disallow	/

Rule

Path

Disallow

youdaobot

Rule	Path
Disallow	/

Rule

Path

Disallow

vegebot

Rule	Path
Disallow	/

Rule

Path

Disallow

searchmetricsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

yahoo! slurp

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	20

Field

Value

crawl-delay

vegebot

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

vegi bot

Rule	Path
Disallow	/

Rule

Path

Disallow

*

Rule	Path
Disallow	/img/uploads/users/identification

Rule

Path

Disallow

/img/uploads/users/identification

Other Records

Field	Value
sitemap	https://www.tutormate.org.uk/sitemap-index.xml

Field

Value

sitemap

https://www.tutormate.org.uk/sitemap-index.xml

Warnings

2 invalid lines.

tutormate.org.ukrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

yandex

sistrix

hubspot webcrawler

panscient.com

baiduspider

sogou spider

facebookexternalhit/1.1

semetrical

riddler

youdaobot

vegebot

searchmetricsbot

yahoo! slurp

Other Records

vegebot

mj12bot

vegi bot

*

Other Records

Warnings

tutormate.org.uk
robots.txt