imerys.com
robots.txt

Robots Exclusion Standard data for imerys.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	imerys.com
Base Domain	imerys.com
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2024-11-01T16:37:03+00:00
Next Scan	2025-01-30T16:37:03+00:00

Last Successful Scan

Scanned	2024-06-12T16:07:46+00:00
URL	https://imerys.com/robots.txt
Domain IPs	13.249.39.23, 13.249.39.41, 13.249.39.48, 13.249.39.70
Response IP	13.249.39.23
Found	Yes
Hash	56d89a949db56b2def07cb9d71893b1229af19bbb9098dd00eca46fd06090a4c
SimHash	b916bd534760

Groups

*

Rule	Path
Disallow	/core/
Disallow	/profiles/
Disallow	/README.txt
Disallow	/web.config
Disallow	/admin/
Disallow	/comment/reply/
Disallow	/filter/tips
Disallow	/node/add/
Disallow	/search/
Disallow	/user/register
Disallow	/user/password
Disallow	/user/login
Disallow	/user/logout
Disallow	/index.php/admin/
Disallow	/index.php/comment/reply/
Disallow	/index.php/filter/tips
Disallow	/index.php/node/add/
Disallow	/index.php/search/
Disallow	/index.php/user/password
Disallow	/index.php/user/register
Disallow	/index.php/user/login
Disallow	/index.php/user/logout
Disallow	/search?
Disallow	/search*
Disallow	/*?
Disallow	/admin
Disallow	/comment/reply
Disallow	/node/add
Disallow	/search
Disallow	/?q=admin
Disallow	/?q=comment%2Freply
Disallow	/?q=contact
Disallow	/?q=logout
Disallow	/?q=node%2Fadd
Disallow	/?q=search
Disallow	/?q=user%2Fpassword
Disallow	/?q=user%2Fregister
Disallow	/?q=user%2Flogin

Rule

Path

Disallow

/core/

Disallow

/profiles/

Disallow

/README.txt

Disallow

/web.config

Disallow

/admin/

Disallow

/comment/reply/

Disallow

/filter/tips

Disallow

/node/add/

Disallow

/search/

Disallow

/user/register

Disallow

/user/password

Disallow

/user/login

Disallow

/user/logout

Disallow

/index.php/admin/

Disallow

/index.php/comment/reply/

Disallow

/index.php/filter/tips

Disallow

/index.php/node/add/

Disallow

/index.php/search/

Disallow

/index.php/user/password

Disallow

/index.php/user/register

Disallow

/index.php/user/login

Disallow

/index.php/user/logout

Disallow

*/search?*

Disallow

/search*

Disallow

/*?

Disallow

/admin

Disallow

/comment/reply

Disallow

/node/add

Disallow

/search

Disallow

/?q=admin

Disallow

/?q=comment%2Freply

Disallow

/?q=contact

Disallow

/?q=logout

Disallow

/?q=node%2Fadd

Disallow

/?q=search

Disallow

/?q=user%2Fpassword

Disallow

/?q=user%2Fregister

Disallow

/?q=user%2Flogin

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

10

Back to top

Other Records

Field	Value
sitemap	https://www.imerys.com/sitemap.xml

Field

Value

sitemap

https://www.imerys.com/sitemap.xml

Back to top

Comments

robots.txt
This file is to prevent the crawling and indexing of certain parts
of your site by web crawlers and spiders run by sites like Yahoo!
and Google. By telling these "robots" where not to go on your site,
you save bandwidth and server resources.
This file will be ignored unless it is at the root of your host:
Used: http://example.com/robots.txt
Ignored: http://example.com/site/robots.txt
For more information about the robots.txt standard, see:
http://www.robotstxt.org/robotstxt.html
CSS, JS, Images
Allow: /core/*.css$
Allow: /core/*.css?
Allow: /core/*.js$
Allow: /core/*.js?
Allow: /core/*.gif
Allow: /core/*.jpg
Allow: /core/*.jpeg
Allow: /core/*.png
Allow: /core/*.svg
Allow: /profiles/*.css$
Allow: /profiles/*.css?
Allow: /profiles/*.js$
Allow: /profiles/*.js?
Allow: /profiles/*.gif
Allow: /profiles/*.jpg
Allow: /profiles/*.jpeg
Allow: /profiles/*.png
Allow: /profiles/*.svg
Directories
Files
Paths (clean URLs)
Paths (no clean URLs)
Paths (clean URLs) – fixed!
Paths (no clean URLs) – fixed!

Back to top

imerys.comrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

Other Records

Other Records

Comments

imerys.com
robots.txt