jyu.fi
robots.txt

Robots Exclusion Standard data for jyu.fi

Archived Snapshots

Resource Scan

Scan Details

Site Domain	jyu.fi
Base Domain	jyu.fi
Scan Status	Ok
Last Scan	2024-10-20T19:38:03+00:00
Next Scan	2024-11-19T19:38:03+00:00

Last Scan

Scanned	2024-10-20T19:38:03+00:00
URL	https://jyu.fi/robots.txt
Redirect	https://www.jyu.fi/robots.txt
Redirect Domain	www.jyu.fi
Redirect Base	jyu.fi
Domain IPs	130.234.6.163
Redirect IPs	130.234.6.163
Response IP	130.234.6.163
Found	Yes
Hash	c6e0a07946ed25b38121c1036b5ac548c5d1332e2a1c23d8dec2552b10a16c25
SimHash	ac14bf514d60

Groups

*

Rule	Path
Allow	/core/*.css$
Allow	/core/*.css?
Allow	/core/*.js$
Allow	/core/*.js?
Allow	/core/*.gif
Allow	/core/*.jpg
Allow	/core/*.jpeg
Allow	/core/*.png
Allow	/core/*.svg
Allow	/profiles/*.css$
Allow	/profiles/*.css?
Allow	/profiles/*.js$
Allow	/profiles/*.js?
Allow	/profiles/*.gif
Allow	/profiles/*.jpg
Allow	/profiles/*.jpeg
Allow	/profiles/*.png
Allow	/profiles/*.svg
Disallow	/core/
Disallow	/profiles/
Disallow	/README.txt
Disallow	/web.config
Disallow	/admin/
Disallow	/comment/reply/
Disallow	/filter/tips
Disallow	/node/add/
Disallow	/search/
Disallow	/user/register/
Disallow	/user/password/
Disallow	/user/login/
Disallow	/user/logout/
Disallow	/index.php/admin/
Disallow	/index.php/comment/reply/
Disallow	/index.php/filter/tips
Disallow	/index.php/node/add/
Disallow	/index.php/search/
Disallow	/index.php/user/password/
Disallow	/index.php/user/register/
Disallow	/index.php/user/login/
Disallow	/index.php/user/logout/

Rule

Path

Allow

/core/*.css$

Allow

/core/*.css?

Allow

/core/*.js$

Allow

/core/*.js?

Allow

/core/*.gif

Allow

/core/*.jpg

Allow

/core/*.jpeg

Allow

/core/*.png

Allow

/core/*.svg

Allow

/profiles/*.css$

Allow

/profiles/*.css?

Allow

/profiles/*.js$

Allow

/profiles/*.js?

Allow

/profiles/*.gif

Allow

/profiles/*.jpg

Allow

/profiles/*.jpeg

Allow

/profiles/*.png

Allow

/profiles/*.svg

Disallow

/core/

Disallow

/profiles/

Disallow

/README.txt

Disallow

/web.config

Disallow

/admin/

Disallow

/comment/reply/

Disallow

/filter/tips

Disallow

/node/add/

Disallow

/search/

Disallow

/user/register/

Disallow

/user/password/

Disallow

/user/login/

Disallow

/user/logout/

Disallow

/index.php/admin/

Disallow

/index.php/comment/reply/

Disallow

/index.php/filter/tips

Disallow

/index.php/node/add/

Disallow

/index.php/search/

Disallow

/index.php/user/password/

Disallow

/index.php/user/register/

Disallow

/index.php/user/login/

Disallow

/index.php/user/logout/

*

Rule	Path
Disallow

Rule

Path

Disallow

discordbot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

mauibot

Rule	Path
Disallow	/

Rule

Path

Disallow

qwantify

Rule	Path
Disallow	/

Rule

Path

Disallow

zumbot

Rule	Path
Disallow	/

Rule

Path

Disallow

arachni

Rule	Path
Disallow	/

Rule

Path

Disallow

velenpublicwebcrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

yandexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

mediatoolkitbot

Rule	Path
Disallow	/

Rule

Path

Disallow

googlebot

Rule	Path
Disallow	/*?
Disallow	/*atct_album_view$
Disallow	/*folder_factories$
Disallow	/*folder_summary_view$
Disallow	/*login_form$
Disallow	/*mail_password_form$
Disallow	/*%40%40search$
Disallow	/*/search$
Disallow	/*search_rss$
Disallow	/*sendto_form$
Disallow	/*summary_view$
Disallow	/*thumbnail_view$
Disallow	/*/view$

Rule

Path

Disallow

/*?

Disallow

/*atct_album_view$

Disallow

/*folder_factories$

Disallow

/*folder_summary_view$

Disallow

/*login_form$

Disallow

/*mail_password_form$

Disallow

/*%40%40search$

Disallow

/*/search$

Disallow

/*search_rss$

Disallow

/*sendto_form$

Disallow

/*summary_view$

Disallow

/*thumbnail_view$

Disallow

/*/view$

Other Records

Field	Value
sitemap	https://www.jyu.fi/sitemap.xml

Field

Value

sitemap

https://www.jyu.fi/sitemap.xml

Comments

robots.txt
This file is to prevent the crawling and indexing of certain parts
of your site by web crawlers and spiders run by sites like Yahoo!
and Google. By telling these "robots" where not to go on your site,
you save bandwidth and server resources.
This file will be ignored unless it is at the root of your host:
Used: http://example.com/robots.txt
Ignored: http://example.com/site/robots.txt
For more information about the robots.txt standard, see:
http://www.robotstxt.org/robotstxt.html
Sitemap
CSS, JS, Images
Directories
Files
Paths (clean URLs)
Paths (no clean URLs)
Define access-restrictions for robots/spiders
http://www.robotstxt.org/wc/norobots.html
By default we allow robots to access all areas of our site
already accessible to anonymous users
Add Googlebot-specific syntax extension to exclude forms
that are repeated for each piece of content in the site
the wildcard is only supported by Googlebot
http://www.google.com/support/webmasters/bin/answer.py?answer=40367&ctx=sibling

Warnings

2 invalid lines.

jyu.firobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

*

discordbot

semrushbot

mauibot

qwantify

zumbot

arachni

velenpublicwebcrawler

ccbot

yandexbot

mediatoolkitbot

googlebot

Other Records

Comments

Warnings

jyu.fi
robots.txt