ae.hm.com
robots.txt

Robots Exclusion Standard data for ae.hm.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	ae.hm.com
Base Domain	hm.com
Scan Status	Ok
Last Scan	2026-01-13T20:53:32+00:00
Next Scan	2026-01-27T20:53:32+00:00

Last Scan

Scanned	2026-01-13T20:53:32+00:00
URL	https://ae.hm.com/robots.txt
Domain IPs	151.101.1.124, 151.101.129.124, 151.101.193.124, 151.101.65.124
Response IP	146.75.45.124
Found	Yes
Hash	8285027aac9105065d1a48cdeaace6ea1b1dfb18936ae2f963a99d85abb69ed3
SimHash	b8149d0bc365

Groups

*

Rule	Path
Disallow	/*?
Disallow	/--
Disallow	/?
Disallow	/system/404?referer
Disallow	/*?q=
Allow	/
Allow	/media_?
Allow	/*.json?
Allow	/?selected
Allow	/*?page=
Disallow	*/fragments/
Disallow	*/tools/
Disallow	*/cart/
Disallow	*/user/
Disallow	*/footer
Disallow	*/header
Disallow	*/checkout

Rule

Path

Disallow

/*?

Disallow

*/--*

Disallow

*/?*

Disallow

*/system/404?referer*

Disallow

/*?q=

Allow

/

Allow

/*media_*?

Allow

/*.json?

Allow

/*?selected*

Allow

/*?page=

Disallow

*/fragments/

Disallow

*/tools/

Disallow

*/cart/

Disallow

*/user/

Disallow

*/footer

Disallow

*/header

Disallow

*/checkout

Back to top

Other Records

Field	Value
sitemap	https://ae.hm.com/sitemap.xml

Field

Value

sitemap

https://ae.hm.com/sitemap.xml

Back to top

Comments

robots.txt
This file is to prevent the crawling and indexing of certain parts
of your site by web crawlers and spiders run by sites like Yahoo!
and Google. By telling these "robots" where not to go on your site,
you save bandwidth and server resources.
This file will be ignored unless it is at the root of your host:
Used: http://example.com/robots.txt
Ignored: http://example.com/site/robots.txt
For more information about the robots.txt standard, see:
http://www.robotstxt.org/robotstxt.html
Files
XML sitemap

Back to top

ae.hm.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

Comments

ae.hm.com
robots.txt