theglamandglitter.com
robots.txt

Robots Exclusion Standard data for theglamandglitter.com

Resource Scan

Scan Details

Site Domain theglamandglitter.com
Base Domain theglamandglitter.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan4/13/2025, 12:31:56 PM
Next Scan 7/12/2025, 12:31:56 PM

Last Successful Scan

Scanned1/8/2023, 5:40:24 AM
URL https://theglamandglitter.com/robots.txt
Domain IPs 104.21.37.226, 172.67.214.89, 2606:4700:3031::6815:25e2, 2606:4700:3035::ac43:d659
Response IP 104.21.37.226
Found Yes
Hash 62c2312400d2099fb1626edf79900d22bd419aa9358cf848acf4572a049486a5
SimHash 201cdd02e692

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

irlbot

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

sogou

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

sistrix crawler

Rule Path
Disallow /

ezooms robot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

perl lwp

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

netestate ne crawler (+http://www.website-datenbank.de/)

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

baiduspider
baiduspider-video
baiduspider-image

Rule Path
Disallow /

youdaobot

Rule Path
Disallow /

megaindex.ru/2.0

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://theglamandglitter.com/sitemap_index.xml