svz.de
robots.txt

Robots Exclusion Standard data for svz.de

Resource Scan

Scan Details

Site Domain svz.de
Base Domain svz.de
Scan Status Ok
Last Scan2024-05-11T15:13:15+00:00
Next Scan 2024-05-18T15:13:15+00:00

Last Scan

Scanned2024-05-11T15:13:15+00:00
URL https://svz.de/robots.txt
Redirect https://www.svz.de/robots.txt
Redirect Domain www.svz.de
Redirect Base svz.de
Domain IPs 18.159.179.202, 18.193.59.2, 3.127.34.154
Redirect IPs 18.155.202.121, 18.155.202.29, 18.155.202.81, 18.155.202.97, 2600:9000:200f:2800:9:2bfd:f680:93a1, 2600:9000:200f:4200:9:2bfd:f680:93a1, 2600:9000:200f:4a00:9:2bfd:f680:93a1, 2600:9000:200f:5000:9:2bfd:f680:93a1, 2600:9000:200f:8200:9:2bfd:f680:93a1, 2600:9000:200f:a200:9:2bfd:f680:93a1, 2600:9000:200f:be00:9:2bfd:f680:93a1, 2600:9000:200f:c600:9:2bfd:f680:93a1
Response IP 18.165.171.111
Found Yes
Hash 84dcfe0ce17ea7d513e5c77822c07c638a6f3dea8033cda44792f02d246c0e37
SimHash 2a305d006df1

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /pagebuilding/
Disallow /new-articles/
Disallow /files/
Disallow /cre-1.0/tracking/
Disallow

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.svz.de/sitemap.xml
sitemap https://www.svz.de/sitemap/googleNewsList.xml
sitemap https://www.svz.de/sitemap/artikel/sitemap-current.xml

Comments

  • Legal notice: svz.de expressly reserves the right to use its content for commercial text and data mining (§ 44b UrhG).
  • The use of robots or other automated means to access svz.de or collect or mine data without the express permission of svz.de is strictly prohibited.
  • If you would like to apply for permission to crawl svz.de, collect or use data, please contact info+nutzungsrecht@noz-digital.de