ahgz.de
robots.txt

Robots Exclusion Standard data for ahgz.de

Resource Scan

Scan Details

Site Domain ahgz.de
Base Domain ahgz.de
Scan Status Ok
Last Scan2024-06-13T22:17:59+00:00
Next Scan 2024-06-20T22:17:59+00:00

Last Scan

Scanned2024-06-13T22:17:59+00:00
URL https://ahgz.de/robots.txt
Redirect https://www.ahgz.de/robots.txt
Redirect Domain www.ahgz.de
Redirect Base ahgz.de
Domain IPs 185.233.189.103
Redirect IPs 185.233.189.103
Response IP 185.233.189.103
Found Yes
Hash 01ab5b49ade949fd6798bb7b99eec173ed2b89af42e605302faa114af9457047
SimHash bb277d1869f7

Groups

*

Rule Path
Disallow /suche/?OK*
Disallow /user/
Disallow /stats/c/
Disallow */admin/
Disallow *?login*
Disallow *?thankyou*

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.ahgz.de/sitemap.xml
sitemap https://www.ahgz.de/sitemap_news.xml
sitemap https://www.ahgz.de/sitemap_image.xml

Comments

  • Legal notice: Deutscher Fachverlag GmbH expressly reserves the right to use all content available under [ahgz.de] for commercial text and data mining (ยง 44b UrhG).
  • The use of robots or other automated means to access [ahgz.de] or collect or mine data without the express permission of Deutscher Fachverlag GmbH is strictly prohibited.
  • If you would like to apply for permission to crawl [ahgz.de], collect or use data for commercial text and data mining, please contact content-syndication@dfv.de.