intersana.de
robots.txt

Robots Exclusion Standard data for intersana.de

Resource Scan

Scan Details

Site Domain intersana.de
Base Domain intersana.de
Scan Status Ok
Last Scan2024-09-19T00:27:11+00:00
Next Scan 2024-09-26T00:27:11+00:00

Last Scan

Scanned2024-09-19T00:27:11+00:00
URL https://intersana.de/robots.txt
Redirect https://www.intersana.de/robots.txt
Redirect Domain www.intersana.de
Redirect Base intersana.de
Domain IPs 213.182.15.150
Redirect IPs 213.182.15.150
Response IP 213.182.15.150
Found Yes
Hash c9ef95623fe09c14f31f6de00f0425ade7bfa7cd3966cdb3b0cf35d64962e2f2
SimHash b2227554edb5

Groups

*

Rule Path
Disallow /cms_addon
Disallow /cms_docs
Disallow /redFACT
Disallow /REST/frontend/itemstatistics

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.intersana.de/sitemap-index_1-Google_Sitemap.xml
sitemap https://www.intersana.de/sitemap-index_3-Google_News_Sitemap.xml

Comments

  • global live settings :
  • Legal notice: intersana.de expressly reserves the right to use its content for commercial text and data mining (ยง44b UrhG).
  • The use of robots or other automated means to access intersana.de or collect or mine data without the express permission intersana.de is strictly prohibited.
  • If you would like to apply for permission to crawl all-in.de, collect or use data, please contact info@intersana.de