ligaschile.cl
robots.txt

Robots Exclusion Standard data for ligaschile.cl

Archived Snapshots

Resource Scan

Scan Details

Site Domain	ligaschile.cl
Base Domain	ligaschile.cl
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a server error.
Last Scan	2024-10-30T11:17:56+00:00
Next Scan	2025-01-28T11:17:56+00:00

Last Successful Scan

Scanned	2024-07-03T11:16:13+00:00
URL	https://ligaschile.cl/robots.txt
Domain IPs	217.79.240.213
Response IP	217.79.240.213
Found	Yes
Hash	c2e7abcddc8e4a8e510dfe985717d68c419717fca4d569cb0484ff53e103463c
SimHash	48d45f14c113

Groups

*

Rule	Path
Disallow	/cgi-bin
Disallow	/wp-content/plugins/
Disallow	/wp-content/themes/
Disallow	/wp-includes/
Disallow	/wp-admin/
Allow	/feed/$
Disallow	/feed
Disallow	/comments/feed
Disallow	/*/feed/$
Disallow	/*/feed/rss/$
Disallow	/*/trackback/$
Disallow	///feed/$
Disallow	///feed/rss/$
Disallow	///trackback/$
Disallow	///*/feed/$
Disallow	///*/feed/rss/$
Disallow	///*/trackback/$
Allow	/*.js$
Allow	/*.css$
Disallow	/*.pdf$

Rule

Path

Disallow

/cgi-bin

Disallow

/wp-content/plugins/

Disallow

/wp-content/themes/

Disallow

/wp-includes/

Disallow

/wp-admin/

Allow

/feed/$

Disallow

/feed

Disallow

/comments/feed

Disallow

/*/feed/$

Disallow

/*/feed/rss/$

Disallow

/*/trackback/$

Disallow

/*/*/feed/$

Disallow

/*/*/feed/rss/$

Disallow

/*/*/trackback/$

Disallow

/*/*/*/feed/$

Disallow

/*/*/*/feed/rss/$

Disallow

/*/*/*/trackback/$

Allow

/*.js$

Allow

/*.css$

Disallow

/*.pdf$

googlebot-image

Rule	Path
Allow	/wp-content/uploads/

Rule

Path

Allow

/wp-content/uploads/

adsbot-google

Rule	Path
Allow	/

Rule

Path

Allow

googlebot-mobile

Rule	Path
Allow	/

Rule

Path

Allow

msiecrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

webcopier

Rule	Path
Disallow	/

Rule

Path

Disallow

httrack

Rule	Path
Disallow	/

Rule

Path

Disallow

microsoft.url.control

Rule	Path
Disallow	/

Rule

Path

Disallow

libwww

Rule	Path
Disallow	/

Rule

Path

Disallow

baiduspider

Rule	Path
Disallow	/

Rule

Path

Disallow

gurujibot

Rule	Path
Disallow	/

Rule

Path

Disallow

hl_ftien_spider

Rule	Path
Disallow	/

Rule

Path

Disallow

sogou spider

Rule	Path
Disallow	/

Rule

Path

Disallow

yeti

Rule	Path
Disallow	/

Rule

Path

Disallow

yodaobot

Rule	Path
Disallow	/

Rule

Path

Disallow

grapeshot

Rule	Path
Disallow

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://ligaschile.cl/news-sitemap.xml
sitemap	https://ligaschile.cl/post-sitemap.xml

Field

Value

sitemap

https://ligaschile.cl/news-sitemap.xml

sitemap

https://ligaschile.cl/post-sitemap.xml

Comments

This virtual robots.txt file was created by the Virtual Robots.txt WordPress plugin: https://www.wordpress.org/plugins/pc-robotstxt/
Bloquear o permitir acceso a contenido adjunto. (Si la instalación está en /public_html).
Impedir el acceso a los diferentes feed que genere la página
Impedir URLs terminadas en /trackback/ que sirven como Trackback URL.
Evita bloqueos de CSS y JS.
Bloquear todos los pdfs
Bloquear parámetros
Lista de bots que deberías permitir.
Lista de bots bloqueados
Desautorizar a páginas innecesarias
Disallow: /gracias-por-suscribirte
Añadimos una indicación de la localización del sitemap
Permitir que Oracle Data Cloud Crawler rastree el sitio

Warnings

1 invalid line.

ligaschile.clrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

googlebot-image

adsbot-google

googlebot-mobile

msiecrawler

webcopier

httrack

microsoft.url.control

libwww

baiduspider

gurujibot

hl_ftien_spider

sogou spider

yeti

yodaobot

grapeshot

Other Records

Comments

Warnings

ligaschile.cl
robots.txt