comune.bergamo.it
robots.txt

Robots Exclusion Standard data for comune.bergamo.it

Archived Snapshots

Resource Scan

Scan Details

Site Domain	comune.bergamo.it
Base Domain	comune.bergamo.it
Scan Status	Ok
Last Scan	2026-02-09T04:57:00+00:00
Next Scan	2026-03-11T04:57:00+00:00

Last Scan

Scanned	2026-02-09T04:57:00+00:00
URL	https://comune.bergamo.it/robots.txt
Redirect	https://www.comune.bergamo.it/robots.txt
Redirect Domain	www.comune.bergamo.it
Redirect Base	comune.bergamo.it
Domain IPs	34.154.172.198
Redirect IPs	80.211.185.159
Response IP	80.211.185.159
Found	Yes
Hash	6bfc660732757141ad0734e8443e73be436d875fc0f4f0633e3998e2525a3b2b
SimHash	3996bd0be768

Groups

*

Rule	Path
Allow	/core/*.css$
Allow	/core/*.css?
Allow	/core/*.js$
Allow	/core/*.js?
Allow	/core/*.gif
Allow	/core/*.jpg
Allow	/core/*.jpeg
Allow	/core/*.png
Allow	/core/*.svg
Allow	/profiles/*.css$
Allow	/profiles/*.css?
Allow	/profiles/*.js$
Allow	/profiles/*.js?
Allow	/profiles/*.gif
Allow	/profiles/*.jpg
Allow	/profiles/*.jpeg
Allow	/profiles/*.png
Allow	/profiles/*.svg
Disallow	/core/
Disallow	/profiles/
Disallow	/README.txt
Disallow	/web.config
Disallow	/admin/
Disallow	/comment/reply/
Disallow	/filter/tips
Disallow	/node/add/
Disallow	/search/
Disallow	/user/register
Disallow	/user/password
Disallow	/user/login
Disallow	/user/logout
Disallow	/media/oembed
Disallow	/*/media/oembed
Disallow	/index.php/admin/
Disallow	/index.php/comment/reply/
Disallow	/index.php/filter/tips
Disallow	/index.php/node/add/
Disallow	/index.php/search/
Disallow	/index.php/user/password
Disallow	/index.php/user/register
Disallow	/index.php/user/login
Disallow	/index.php/user/logout
Disallow	/index.php/media/oembed
Disallow	/index.php/*/media/oembed
Disallow	/api-doc
Disallow	/api-doc-ui
Disallow	/ticket_assistenza
Disallow	/form/trasparenza-titolari-incarichi-politici
Disallow	/form/trasparenza-titolari-incarichi-amministrativi-di-vertice
Disallow	/form/trasparenza-titolari-incarichi-dirigenziali
Disallow	/form/trasparenza-posizioni-organizzative
Disallow	/modulo-iscrizione-evento-pa-digitale-2026

Rule

Path

Allow

/core/*.css$

Allow

/core/*.css?

Allow

/core/*.js$

Allow

/core/*.js?

Allow

/core/*.gif

Allow

/core/*.jpg

Allow

/core/*.jpeg

Allow

/core/*.png

Allow

/core/*.svg

Allow

/profiles/*.css$

Allow

/profiles/*.css?

Allow

/profiles/*.js$

Allow

/profiles/*.js?

Allow

/profiles/*.gif

Allow

/profiles/*.jpg

Allow

/profiles/*.jpeg

Allow

/profiles/*.png

Allow

/profiles/*.svg

Disallow

/core/

Disallow

/profiles/

Disallow

/README.txt

Disallow

/web.config

Disallow

/admin/

Disallow

/comment/reply/

Disallow

/filter/tips

Disallow

/node/add/

Disallow

/search/

Disallow

/user/register

Disallow

/user/password

Disallow

/user/login

Disallow

/user/logout

Disallow

/media/oembed

Disallow

/*/media/oembed

Disallow

/index.php/admin/

Disallow

/index.php/comment/reply/

Disallow

/index.php/filter/tips

Disallow

/index.php/node/add/

Disallow

/index.php/search/

Disallow

/index.php/user/password

Disallow

/index.php/user/register

Disallow

/index.php/user/login

Disallow

/index.php/user/logout

Disallow

/index.php/media/oembed

Disallow

/index.php/*/media/oembed

Disallow

/api-doc

Disallow

/api-doc-ui

Disallow

/ticket_assistenza

Disallow

/form/trasparenza-titolari-incarichi-politici

Disallow

/form/trasparenza-titolari-incarichi-amministrativi-di-vertice

Disallow

/form/trasparenza-titolari-incarichi-dirigenziali

Disallow

/form/trasparenza-posizioni-organizzative

Disallow

/modulo-iscrizione-evento-pa-digitale-2026

Back to top

Comments

robots.txt
This file is to prevent the crawling and indexing of certain parts
of your site by web crawlers and spiders run by sites like Yahoo!
and Google. By telling these "robots" where not to go on your site,
you save bandwidth and server resources.
This file will be ignored unless it is at the root of your host:
Used: http://example.com/robots.txt
Ignored: http://example.com/site/robots.txt
For more information about the robots.txt standard, see:
http://www.robotstxt.org/robotstxt.html
CSS, JS, Images
Directories
Files
Paths (clean URLs)
Paths (no clean URLs)
API documentation
PORTSTU-241

Back to top

comune.bergamo.itrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Comments

comune.bergamo.it
robots.txt