instagrizli.com
robots.txt
Robots Exclusion Standard data for instagrizli.com
Resource Scan
Scan Details
Site Domain | instagrizli.com |
Base Domain | instagrizli.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Couldn't connect to server. |
Last Scan | 2024-10-16T19:18:29+00:00 |
Next Scan | 2025-01-14T19:18:29+00:00 |
Last Successful Scan
Scanned | 2023-09-23T09:40:28+00:00 |
URL | https://instagrizli.com/robots.txt |
Domain IPs | 87.236.16.235 |
Response IP | 87.236.16.235 |
Found | Yes |
Hash | 4e269027199c1270d956f88972ecc90085c4b9cb704171133d5d0f60bcff61b4 |
SimHash | 8b10ddb027ee |
Groups
*
Rule | Path |
---|---|
Disallow | /cgi-bin |
Disallow | /wp-admin |
Disallow | /wp-includes |
Disallow | /wp-content/ |
Allow | /wp-content/themes/instagrizli/dist/styles/ |
Allow | /wp-content/themes/instagrizli/dist/js/ |
Allow | /wp-content/themes/instagrizli/images |
Disallow | /trackback |
Disallow | */trackback |
Disallow | */*/trackback |
Disallow | /author/ |
Disallow | /wp-login.php |
Disallow | */attachment/ |
Disallow | /search |
Disallow | */comment/ |
Disallow | /category/ |
Disallow | */print/ |
Disallow | /wp-json/ |
Disallow | */feed |
Disallow | /tag/ |
Disallow | /2019/ |
Disallow | *? |
Other Records
Field | Value |
---|---|
sitemap | https://instagrizli.com/sitemap.xml |