restaurant.info
robots.txt

Robots Exclusion Standard data for restaurant.info

Resource Scan

Scan Details

Site Domain restaurant.info
Base Domain restaurant.info
Scan Status Ok
Last Scan2024-06-08T01:55:29+00:00
Next Scan 2024-06-15T01:55:29+00:00

Last Scan

Scanned2024-06-08T01:55:29+00:00
URL https://restaurant.info/robots.txt
Domain IPs 136.243.168.91
Response IP 136.243.168.91
Found Yes
Hash bf2bb8b7547f96825aea71672cdf1c781741de118bcb6a34dd18427ab437ad05
SimHash b030a0422d91

Groups

googlebot-image

Rule Path
Disallow /img-embedded/

*

Rule Path
Disallow /manage
Disallow /admin
Disallow /api/
Disallow /buchen/
Disallow /styleguide
Disallow /content-management-guide
Disallow /*-favoriten
Disallow /*-vergleichen
Disallow /*-verbessern/
Disallow /Discoverize.Pictures/UserPictureUpload/GetForm*
Disallow /Discoverize.Entry/Entry/GetAvailability
Disallow /Discoverize.Entry/Entry/SubmitRating
Disallow /bewertungs-widget/*
Disallow /tn/Pages/Page*
Disallow /TrackEntryPageView/
Disallow /TrackHomepageClick/
Disallow /TrackShowPhoneClick/
Disallow /TrackPhoneLinkClick/
Disallow /TrackCustomCallToActionClick/
Disallow /pressespiegel

Other Records

Field Value
sitemap https://Restaurant.Info/sitemap.xml

Comments

  • don't disallow /Track because we might have search or entry pages with that prefix
  • requested by Thematica, probably no other pages will be impacted