glamournaturel.com
robots.txt

Robots Exclusion Standard data for glamournaturel.com

Resource Scan

Scan Details

Site Domain glamournaturel.com
Base Domain glamournaturel.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-09-17T03:51:29+00:00
Next Scan 2024-12-16T03:51:29+00:00

Last Successful Scan

Scanned2024-03-31T09:50:33+00:00
URL https://glamournaturel.com/robots.txt
Domain IPs 104.21.15.3, 172.67.160.245, 2606:4700:3030::ac43:a0f5, 2606:4700:3035::6815:f03
Response IP 172.67.160.245
Found Yes
Hash c1f3f55d3a3a0aaca7ee23388da111f2c12bf4c19ecc06eac2b44f88e1fd201b
SimHash 7b0fdc206ab1

Groups

ahrefsbot
aspiegelbot
blexbot
barkrowler
dotbot
mj12bot
mauibot
nimbostratus-bot
petalbot
semrushbot
seznambot
sogou
serpstatbot
trendiction
textbulkerbot

Rule Path
Disallow /wp-admin/

Other Records

Field Value
crawl-delay 180

gptbot

Rule Path
Disallow /

*

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.glamournaturel.com/post-sitemap.xml
sitemap https://www.glamournaturel.com/sitemap_index.xml
sitemap https://www.glamournaturel.com/sitemap-news.xml

Warnings

  • 1 invalid line.