cavallo.de
robots.txt

Robots Exclusion Standard data for cavallo.de

Resource Scan

Scan Details

Site Domain cavallo.de
Base Domain cavallo.de
Scan Status Ok
Last Scan2024-06-30T21:08:54+00:00
Next Scan 2024-07-07T21:08:54+00:00

Last Scan

Scanned2024-06-30T21:08:54+00:00
URL https://cavallo.de/robots.txt
Redirect https://www.cavallo.de/robots.txt
Redirect Domain www.cavallo.de
Redirect Base cavallo.de
Domain IPs 2a01:138:a027:0:e::236, 62.146.96.236
Redirect IPs 2a01:138:a027:0:e::236, 62.146.96.236
Response IP 62.146.96.236
Found Yes
Hash 3959593d0116b50021b3f5f3449e27b047f148a015dc711eb9d8fb559b1eabc2
SimHash 223277186db3

Groups

*

Rule Path
Disallow /irapi/*
Disallow /irelements/*
Disallow /suche/*
Disallow /heft/*
Disallow *ePaper*.pdf

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.cavallo.de/sitemap/cav-sitemap-news.xml
sitemap https://www.cavallo.de/sitemap/cav-sitemap-index.xml
sitemap https://www.cavallo.de/sitemap/cav-sitemap-navigation.xml
sitemap https://www.cavallo.de/sitemap/cav-sitemap-footer.xml
sitemap https://www.cavallo.de/sitemap/cav-sitemap-themenseiten.xml
sitemap https://www.cavallo.de/sitemap/cav-video-sitemap-index.xml

Comments

  • Allow sitemap
  • disallow api calls
  • Legal notice: [https://www.cavallo.de/] expressly reserves the right to use its content for commercial text and data mining (ยง 44b UrhG).
  • The use of robots or other automated means to access [https://www.cavallo.de/] or collect or mine data without the express permission of [https://www.cavallo.de/] is strictly prohibited.
  • If you would like to apply for permission to crawl [https://www.cavallo.de/], collect or use data, please contact [ompi@motorpresse.de]