intra.kth.se
robots.txt

Robots Exclusion Standard data for intra.kth.se

Resource Scan

Scan Details

Site Domain intra.kth.se
Base Domain kth.se
Scan Status Ok
Last Scan2025-12-31T02:07:29+00:00
Next Scan 2026-01-14T02:07:29+00:00

Last Scan

Scanned2025-12-31T02:07:29+00:00
URL https://intra.kth.se/robots.txt
Domain IPs 130.237.28.42, 2001:6b0:1:11c2::82ed:1c2a
Response IP 130.237.28.42
Found Yes
Hash 80ca48d5a6085cb3ef6ed95c0f50947211e6fa994aa725293abd8598251dd2af
SimHash 2d4fdd89ff52

Groups

*

Rule Path
Disallow /lediga-jobb/language_redirect/
Disallow /lediga-jobb/interna/
Disallow /form/
Disallow /public/
Disallow /cm/
Disallow /info/
Disallow /work/
Disallow /xyz/
Disallow /xyz
Disallow /en/xyz/
Disallow /en/xyz
Disallow /2.994/
Disallow /2.9631/
Disallow /2.1019/
Disallow /2.1166/
Disallow /2.1218/
Disallow /2.1219/
Disallow /2.36446/
Disallow /2.14566/
Disallow /2.28744/
Disallow /2.98714/
Disallow /en/2.28744/
Disallow /2.9631/
Disallow /en/2.9631/
Disallow /search
Disallow /social/user/_report_/abuse/
Disallow /gemensamt/kategorier-1.31190
Disallow /en/gemensamt/kategorier-1.31190
Disallow /social/api/profile/1.1/
Disallow /test/
Disallow /blogs/tags
Disallow /es/
Disallow /*?rss=*
Disallow /*?iCal=*
Disallow /api/icalendar/
Disallow /start/
Disallow /intranat
Disallow /sci/2.840

Other Records

Field Value
sitemap https://intra.kth.se/sitemap.xml

Comments

  • well-known resource robots.txt from 17.392