glam.com
robots.txt
Robots Exclusion Standard data for glam.com
Resource Scan
Scan Details
Site Domain | glam.com |
Base Domain | glam.com |
Scan Status | Ok |
Last Scan | 2024-11-01T09:01:31+00:00 |
Next Scan | 2024-11-08T09:01:31+00:00 |
Last Scan
Scanned | 2024-11-01T09:01:31+00:00 |
URL | https://glam.com/robots.txt |
Domain IPs | 3.165.82.100, 3.165.82.102, 3.165.82.3, 3.165.82.43 |
Response IP | 3.165.82.100 |
Found | Yes |
Hash | 2a9b383093929910b77245c5483508f9ff8f492ce704d4854f4483697ed9ba1c |
SimHash | e11448484e93 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Disallow | /wp-includes/ |
Disallow | /*?*ajax= |
Disallow | /*/s/* |
Disallow | /*/sl/* |
Disallow | /search/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.glam.com/sitemap_index.xml |
sitemap | https://www.glam.com/stories/sitemap-index.xml |
sitemap | https://www.glam.com/?getfeed=google |