ithanse.de
robots.txt

Robots Exclusion Standard data for ithanse.de

Resource Scan

Scan Details

Site Domain ithanse.de
Base Domain ithanse.de
Scan Status Ok
Last Scan2024-06-08T15:44:18+00:00
Next Scan 2024-07-08T15:44:18+00:00

Last Scan

Scanned2024-06-08T15:44:18+00:00
URL https://ithanse.de/robots.txt
Redirect https://www.ithanse.de/robots.txt
Redirect Domain www.ithanse.de
Redirect Base ithanse.de
Domain IPs 168.119.242.134
Redirect IPs 168.119.242.134
Response IP 168.119.242.134
Found Yes
Hash 5c9e5449576c5461bc75bb642bf6105627fe3d0fdfdde1b2c0ff627546722aba
SimHash 791ddcb14408

Groups

*

Rule Path
Disallow /bewerbung
Disallow /merkliste
Disallow /feedback
Disallow /jobs/counter
Disallow /jobs/autocomplete
Disallow /apply
Disallow /datenschutz
Disallow /impressum
Disallow /agb
Disallow /widget
Disallow /auth
Disallow /auth/twitter
Disallow /auth/facebook
Disallow /auth/xing
Disallow /auth/linkedin
Disallow /job_subscriptions
Disallow /job_subscriptions/new
Disallow /arbeitgeber
Disallow /IT-jobs/search
Disallow /IT-jobs/search
Disallow /IT-jobs/search

Other Records

Field Value
crawl-delay 10

mj12bot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

qwantify

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

auskunftbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

blexbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

amazonbot

Rule Path
Disallow /

auskunftbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.ithanse.de/system/sitemap.xml.gz

Warnings

  • 3 invalid lines.