student.com
robots.txt

Robots Exclusion Standard data for student.com

Resource Scan

Scan Details

Site Domain student.com
Base Domain student.com
Scan Status Ok
Last Scan2024-06-05T09:36:54+00:00
Next Scan 2024-06-19T09:36:54+00:00

Last Scan

Scanned2024-06-05T09:36:54+00:00
URL https://student.com/robots.txt
Redirect https://www.student.com/robots.txt
Redirect Domain www.student.com
Redirect Base student.com
Domain IPs 13.251.222.126, 18.140.79.229
Redirect IPs 151.101.130.49, 151.101.194.49, 151.101.2.49, 151.101.66.49
Response IP 199.232.46.49
Found Yes
Hash 57bed4d9a39400c02f11e282a33c7c69ed0232e8d0417ec6899f94b09969dd75
SimHash 753151040e50

Groups

*

Rule Path
Allow /css
Allow /css/*?*
Allow /js
Allow /js/*?*
Allow /api/search/universities
Allow /api/universities
Allow /wp-content/*
Allow /wp-includes/*
Allow *?utm*
Disallow /set/
Disallow /reviews
Disallow /*/enquiry/*
Disallow /enquiry/*
Disallow /*/compare/*
Disallow /compare/*
Disallow /*/my-account/*
Disallow /my-account/*
Disallow /wechat-qr-code
Disallow /articles/search/*
Disallow */null
Disallow *?*

adsbot-google

Rule Path
Disallow /reviews

vegi bot

Rule Path
Disallow /

rogerbot

Rule Path
Disallow

Other Records

Field Value
crawl-delay 2

semrushbot-sa

Rule Path
Disallow

Other Records

Field Value
crawl-delay 2

Other Records

Field Value
sitemap https://www.student.com/sitemap.xml