iclnoticias.com
robots.txt

Robots Exclusion Standard data for iclnoticias.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	iclnoticias.com
Base Domain	iclnoticias.com
Scan Status	Ok
Last Scan	2024-10-27T23:47:02+00:00
Next Scan	2024-11-03T23:47:02+00:00

Last Scan

Scanned	2024-10-27T23:47:02+00:00
URL	https://iclnoticias.com/robots.txt
Domain IPs	104.21.50.84, 172.67.203.162, 2606:4700:3036::6815:3254, 2606:4700:3037::ac43:cba2
Response IP	104.21.50.84
Found	Yes
Hash	d98d076a371473a0674539bade859c200b9d6f17aa2ae74dcf21ef5770b5728f
SimHash	69bc70746eaa

Groups

*

Rule	Path
Disallow	/newland/
Disallow	/kle/

Rule

Path

Disallow

/newland/

Disallow

/kle/

*

Rule	Path
Disallow	/wp-admin/
Disallow	/wp-includes/
Disallow	/cgi-bin/
Disallow	/tmp/
Disallow	/private/
Disallow	/search
Disallow	/tag/
Disallow	/author/
Disallow	/archive/
Disallow	/category/
Allow	/wp-admin/admin-ajax.php

Rule

Path

Disallow

/wp-admin/

Disallow

/wp-includes/

Disallow

/cgi-bin/

Disallow

/tmp/

Disallow

/private/

Disallow

/search

Disallow

/tag/

Disallow

/author/

Disallow

/archive/

Disallow

/category/

Allow

/wp-admin/admin-ajax.php

googlebot

Rule	Path
Disallow	/private/
Allow	/wp-admin/admin-ajax.php

Rule

Path

Disallow

/private/

Allow

/wp-admin/admin-ajax.php

bingbot

Rule	Path
Disallow	/private/
Allow	/wp-admin/admin-ajax.php
Disallow	/wp-content/plugins/
Disallow	/wp-content/cache/
Disallow	/wp-content/themes/
Disallow	/*.jpg$
Disallow	/*.jpeg$
Disallow	/*.png$
Disallow	/*.gif$
Disallow	/*.bmp$
Disallow	/*.tiff$
Disallow	/*.ico$
Disallow	/?replytocom
Disallow	/?utm_source
Disallow	/?utm_medium
Disallow	/?utm_campaign

Rule

Path

Disallow

/private/

Allow

/wp-admin/admin-ajax.php

Disallow

/wp-content/plugins/

Disallow

/wp-content/cache/

Disallow

/wp-content/themes/

Disallow

/*.jpg$

Disallow

/*.jpeg$

Disallow

/*.png$

Disallow

/*.gif$

Disallow

/*.bmp$

Disallow

/*.tiff$

Disallow

/*.ico$

Disallow

/*?*replytocom

Disallow

/*?*utm_source

Disallow

/*?*utm_medium

Disallow

/*?*utm_campaign

bingbot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

10

Back to top

Other Records

Field	Value
sitemap	https://iclnoticias.com/sitemap_index.xmll

Field

Value

sitemap

https://iclnoticias.com/sitemap_index.xmll

Back to top

Comments

Googlebot
Bingbot
Specific directories
Media files
Blocking query parameters
Crawl-delay for Bingbot

Back to top

iclnoticias.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

*

googlebot

bingbot

bingbot

Other Records

Other Records

Comments

iclnoticias.com
robots.txt