instagrizli.com
robots.txt

Robots Exclusion Standard data for instagrizli.com

Resource Scan

Scan Details

Site Domain instagrizli.com
Base Domain instagrizli.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-10-16T19:18:29+00:00
Next Scan 2025-01-14T19:18:29+00:00

Last Successful Scan

Scanned2023-09-23T09:40:28+00:00
URL https://instagrizli.com/robots.txt
Domain IPs 87.236.16.235
Response IP 87.236.16.235
Found Yes
Hash 4e269027199c1270d956f88972ecc90085c4b9cb704171133d5d0f60bcff61b4
SimHash 8b10ddb027ee

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content/
Allow /wp-content/themes/instagrizli/dist/styles/
Allow /wp-content/themes/instagrizli/dist/js/
Allow /wp-content/themes/instagrizli/images
Disallow /trackback
Disallow */trackback
Disallow */*/trackback
Disallow /author/
Disallow /wp-login.php
Disallow */attachment/
Disallow /search
Disallow */comment/
Disallow /category/
Disallow */print/
Disallow /wp-json/
Disallow */feed
Disallow /tag/
Disallow /2019/
Disallow *?

googlebot-image

Rule Path
Disallow /wp-content/uploads/

yandeximages

Rule Path
Disallow /wp-content/uploads/

Other Records

Field Value
sitemap https://instagrizli.com/sitemap.xml