t.neuepresse.de
robots.txt

Robots Exclusion Standard data for t.neuepresse.de

Archived Snapshots

Resource Scan

Scan Details

Site Domain	t.neuepresse.de
Base Domain	neuepresse.de
Scan Status	Ok
Last Scan	2024-11-10T22:06:59+00:00
Next Scan	2024-12-10T22:06:59+00:00

Last Scan

Scanned	2024-11-10T22:06:59+00:00
URL	https://t.neuepresse.de/robots.txt
Redirect	https://www.neuepresse.de/robots.txt
Redirect Domain	www.neuepresse.de
Redirect Base	neuepresse.de
Domain IPs	193.30.60.245
Redirect IPs	184.87.193.148, 184.87.193.157, 2600:1413:b000:13::b857:c194, 2600:1413:b000:13::b857:c19d
Response IP	23.45.207.178
Found	Yes
Hash	109c4306f00b516668b79c4397b4605aaf2a4bc4e32b785494e18bc8bf7ed648
SimHash	a334176c89b1

Groups

*

Rule	Path
Disallow	/disabledFunctionsForCrawlers.chunk.js
Disallow	/mandanten/
Disallow	/mediabox/
Disallow	/politik/politik-extern/
Disallow	/wirtschaft/wirtschaft-extern
Disallow	/suche/
Disallow	/ellipsis-preview/
Disallow	/pf/api/v3/
Disallow	/zeitung/
Disallow	/metaseiten/
Disallow	/var/storage
Disallow	/var/storage/*
Disallow	/bundles/
Disallow	/cms/
Disallow	/security/
Disallow	/newsletter/abmeldung/
Disallow	/angebot/

Rule

Path

Disallow

/disabledFunctionsForCrawlers.chunk.js

Disallow

/mandanten/

Disallow

/mediabox/

Disallow

/politik/politik-extern/

Disallow

/wirtschaft/wirtschaft-extern

Disallow

/suche/

Disallow

/ellipsis-preview/

Disallow

/pf/api/v3/

Disallow

/zeitung/

Disallow

/metaseiten/

Disallow

/var/storage

Disallow

/var/storage/*

Disallow

/bundles/

Disallow

/cms/

Disallow

/security/

Disallow

/newsletter/abmeldung/

Disallow

/angebot/

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

chatgpt-user

Rule	Path
Disallow	/

Rule

Path

Disallow

/

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

/

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Comments

Legal notice: neuepresse.de expressly reserves the right to use its content for commercialtext and data mining (§ 44b UrhG).
The use of robots or other automated means to access neuepresse.de or collect or minedata without the express permission of neuepresse.de is strictly prohibited.
If you would like to apply for permission to crawl neuepresse.de, collect or use data, please contact lizenzen@rnd.de

Back to top

t.neuepresse.derobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

gptbot

ccbot

chatgpt-user

bytespider

claudebot

Comments

t.neuepresse.de
robots.txt