wz-net.de
robots.txt

Robots Exclusion Standard data for wz-net.de

Archived Snapshots

Resource Scan

Scan Details

Site Domain	wz-net.de
Base Domain	wz-net.de
Scan Status	Ok
Last Scan	2024-05-26T18:20:45+00:00
Next Scan	2024-06-02T18:20:45+00:00

Last Scan

Scanned	2024-05-26T18:20:45+00:00
URL	https://wz-net.de/robots.txt
Domain IPs	5.9.164.117
Response IP	5.9.164.117
Found	Yes
Hash	f7d269f4bb31ec91e6e9f7220ea0359579e14a2f09557ade866a61ac4a6c5caa
SimHash	ba141d084f74

Groups

*

Rule	Path
Disallow	/includes/
Disallow	/misc/
Disallow	/modules/
Disallow	/profiles/
Disallow	/scripts/
Disallow	/themes/
Disallow	/CHANGELOG.txt
Disallow	/cron.php
Disallow	/INSTALL.mysql.txt
Disallow	/INSTALL.pgsql.txt
Disallow	/install.php
Disallow	/INSTALL.txt
Disallow	/LICENSE.txt
Disallow	/MAINTAINERS.txt
Disallow	/update.php
Disallow	/UPGRADE.txt
Disallow	/xmlrpc.php
Disallow	/admin/
Disallow	/comment/reply/
Disallow	/filter/tips/
Disallow	/logout/
Disallow	/node/add/
Disallow	/search/
Disallow	/user/register/
Disallow	/user/password/
Disallow	/user/login/
Disallow	/?q=admin%2F
Disallow	/?q=comment%2Freply%2F
Disallow	/?q=filter%2Ftips%2F
Disallow	/?q=logout%2F
Disallow	/?q=node%2Fadd%2F
Disallow	/?q=search%2F
Disallow	/?q=user%2Fpassword%2F
Disallow	/?q=user%2Fregister%2F
Disallow	/?q=user%2Flogin%2F
Disallow	/?_ptid=
Disallow	/sites/all/modules/ad/serve.php
Disallow	/?page=
Disallow	/?destination=
Disallow	/?vtd=
Disallow	/search/content/*
Disallow	/adtest/*
Disallow	/contact/*
Disallow	/user/*

Rule

Path

Disallow

/includes/

Disallow

/misc/

Disallow

/modules/

Disallow

/profiles/

Disallow

/scripts/

Disallow

/themes/

Disallow

/CHANGELOG.txt

Disallow

/cron.php

Disallow

/INSTALL.mysql.txt

Disallow

/INSTALL.pgsql.txt

Disallow

/install.php

Disallow

/INSTALL.txt

Disallow

/LICENSE.txt

Disallow

/MAINTAINERS.txt

Disallow

/update.php

Disallow

/UPGRADE.txt

Disallow

/xmlrpc.php

Disallow

/admin/

Disallow

/comment/reply/

Disallow

/filter/tips/

Disallow

/logout/

Disallow

/node/add/

Disallow

/search/

Disallow

/user/register/

Disallow

/user/password/

Disallow

/user/login/

Disallow

/?q=admin%2F

Disallow

/?q=comment%2Freply%2F

Disallow

/?q=filter%2Ftips%2F

Disallow

/?q=logout%2F

Disallow

/?q=node%2Fadd%2F

Disallow

/?q=search%2F

Disallow

/?q=user%2Fpassword%2F

Disallow

/?q=user%2Fregister%2F

Disallow

/?q=user%2Flogin%2F

Disallow

/*?_ptid=*

Disallow

/sites/all/modules/ad/serve.php

Disallow

/*?page=*

Disallow

/*?destination=*

Disallow

/*?vtd=*

Disallow

/search/content/*

Disallow

/adtest/*

Disallow

/contact/*

Disallow

/user/*

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

chatgpt-user

Rule	Path
Disallow	/

Rule

Path

Disallow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

bingbot

Rule	Path
Disallow	/

Rule

Path

Disallow

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

Comments

Legal notice: nq-online.de expressly reserves the right to use its content for commercial text and data mining (§ 44b UrhG).
The use of robots or other automated means to access nq-online.de or collect or mine data without the express permission of nq-online.de is strictly prohibited.
If you would like to apply for permission to crawl nq-online.de, collect or use data, please contact webmaster@neckarquelle.de
robots.txt
This file is to prevent the crawling and indexing of certain parts
of your site by web crawlers and spiders run by sites like Yahoo!
and Google. By telling these "robots" where not to go on your site,
you save bandwidth and server resources.
This file will be ignored unless it is at the root of your host:
Used: http://example.com/robots.txt
Ignored: http://example.com/site/robots.txt
For more information about the robots.txt standard, see:
http://www.robotstxt.org/robotstxt.html
For syntax checking, see:
http://www.frobee.com/robots-txt-check
Directories
Files
Paths (clean URLs)
Paths (no clean URLs)

wz-net.derobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

chatgpt-user

gptbot

ccbot

bingbot

google-extended

Comments

wz-net.de
robots.txt