ithanse.de
robots.txt
Robots Exclusion Standard data for ithanse.de
Resource Scan
Scan Details
Site Domain | ithanse.de |
Base Domain | ithanse.de |
Scan Status | Ok |
Last Scan | 2024-06-08T15:44:18+00:00 |
Next Scan | 2024-07-08T15:44:18+00:00 |
Last Scan
Scanned | 2024-06-08T15:44:18+00:00 |
URL | https://ithanse.de/robots.txt |
Redirect | https://www.ithanse.de/robots.txt |
Redirect Domain | www.ithanse.de |
Redirect Base | ithanse.de |
Domain IPs | 168.119.242.134 |
Redirect IPs | 168.119.242.134 |
Response IP | 168.119.242.134 |
Found | Yes |
Hash | 5c9e5449576c5461bc75bb642bf6105627fe3d0fdfdde1b2c0ff627546722aba |
SimHash | 791ddcb14408 |
Groups
*
Rule | Path |
---|---|
Disallow | /bewerbung |
Disallow | /merkliste |
Disallow | /feedback |
Disallow | /jobs/counter |
Disallow | /jobs/autocomplete |
Disallow | /apply |
Disallow | /datenschutz |
Disallow | /impressum |
Disallow | /agb |
Disallow | /widget |
Disallow | /auth |
Disallow | /auth/twitter |
Disallow | /auth/facebook |
Disallow | /auth/xing |
Disallow | /auth/linkedin |
Disallow | /job_subscriptions |
Disallow | /job_subscriptions/new |
Disallow | /arbeitgeber |
Disallow | /IT-jobs/search |
Disallow | /IT-jobs/search |
Disallow | /IT-jobs/search |
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
Other Records
Field | Value |
---|---|
sitemap | https://www.ithanse.de/system/sitemap.xml.gz |
Warnings
- 3 invalid lines.