hiphop.de
robots.txt

Robots Exclusion Standard data for hiphop.de

Archived Snapshots

Resource Scan

Scan Details

Site Domain	hiphop.de
Base Domain	hiphop.de
Scan Status	Ok
Last Scan	2024-10-27T20:35:39+00:00
Next Scan	2024-11-03T20:35:39+00:00

Last Scan

Scanned	2024-10-27T20:35:39+00:00
URL	https://hiphop.de/robots.txt
Domain IPs	104.26.6.227, 104.26.7.227, 172.67.70.88, 2606:4700:20::681a:6e3, 2606:4700:20::681a:7e3, 2606:4700:20::ac43:4658
Response IP	104.26.7.227
Found	Yes
Hash	a4c1b42454442cddc5d96941f5a3813f332c5f988f2da71d52695ed807793bdd
SimHash	b816bd4b8760

Groups

*

Rule	Path
Allow	/core/*.css$
Allow	/core/*.css?
Allow	/core/*.js$
Allow	/core/*.js?
Allow	/core/*.svg
Allow	/profiles/*.css$
Allow	/profiles/*.css?
Allow	/profiles/*.js$
Allow	/profiles/*.js?
Allow	/profiles/*.svg
Disallow	/core/
Disallow	/profiles/
Disallow	/README.txt
Disallow	/web.config
Disallow	/admin/
Disallow	/comment/reply/
Disallow	/filter/tips
Disallow	/node/add/
Disallow	/search/
Disallow	/user/register/
Disallow	/user/password/
Disallow	/user/login/
Disallow	/user/logout/
Disallow	/index.php/admin/
Disallow	/index.php/comment/reply/
Disallow	/index.php/filter/tips
Disallow	/index.php/node/add/
Disallow	/index.php/search/
Disallow	/index.php/user/password/
Disallow	/index.php/user/register/
Disallow	/index.php/user/login/
Disallow	/index.php/user/logout/
Disallow	/sites/default/files/BAK_news_articles
Disallow	/sites/default/files/styles/article_header_overlay_large/public/BAK_news_articles

Rule

Path

Allow

/core/*.css$

Allow

/core/*.css?

Allow

/core/*.js$

Allow

/core/*.js?

Allow

/core/*.svg

Allow

/profiles/*.css$

Allow

/profiles/*.css?

Allow

/profiles/*.js$

Allow

/profiles/*.js?

Allow

/profiles/*.svg

Disallow

/core/

Disallow

/profiles/

Disallow

/README.txt

Disallow

/web.config

Disallow

/admin/

Disallow

/comment/reply/

Disallow

/filter/tips

Disallow

/node/add/

Disallow

/search/

Disallow

/user/register/

Disallow

/user/password/

Disallow

/user/login/

Disallow

/user/logout/

Disallow

/index.php/admin/

Disallow

/index.php/comment/reply/

Disallow

/index.php/filter/tips

Disallow

/index.php/node/add/

Disallow

/index.php/search/

Disallow

/index.php/user/password/

Disallow

/index.php/user/register/

Disallow

/index.php/user/login/

Disallow

/index.php/user/logout/

Disallow

/sites/default/files/BAK_news_articles

Disallow

/sites/default/files/styles/article_header_overlay_large/public/BAK_news_articles

ia_archiver

Rule	Path
Disallow	/sites/default/files/

Rule

Path

Disallow

/sites/default/files/

archive.org_bot

Rule	Path
Disallow	/sites/default/files/

Rule

Path

Disallow

/sites/default/files/

facebookexternalhit

Rule	Path
Allow	/

Rule

Path

Allow

/

Back to top

Comments

robots.txt
This file is to prevent the crawling and indexing of certain parts
of your site by web crawlers and spiders run by sites like Yahoo!
and Google. By telling these "robots" where not to go on your site,
you save bandwidth and server resources.
This file will be ignored unless it is at the root of your host:
Used: http://example.com/robots.txt
Ignored: http://example.com/site/robots.txt
For more information about the robots.txt standard, see:
http://www.robotstxt.org/robotstxt.html
CSS, JS, Images
Directories
Files
Paths (clean URLs)
Paths (no clean URLs)
Hide Images
User-agent: Googlebot-Image
Disallow: /sites/default/files/
Facebook 403 Error Fix

Back to top

hiphop.derobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

ia_archiver

archive.org_bot

facebookexternalhit

Comments

hiphop.de
robots.txt