thegroovecartel.com
robots.txt

Robots Exclusion Standard data for thegroovecartel.com

Resource Scan

Scan Details

Site Domain thegroovecartel.com
Base Domain thegroovecartel.com
Scan Status Ok
Last Scan2024-11-15T07:09:43+00:00
Next Scan 2024-11-22T07:09:43+00:00

Last Scan

Scanned2024-11-15T07:09:43+00:00
URL https://thegroovecartel.com/robots.txt
Domain IPs 104.21.14.85, 172.67.202.174, 2606:4700:3031::6815:e55, 2606:4700:3035::ac43:caae
Response IP 104.21.14.85
Found Yes
Hash 7837377959140ff05c2aab7a433977546fe94585d802dcf2e68f35d94ffa412b
SimHash 4805d8060717

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Allow /feed

ia_archiver

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

ia_archiver-web.archive.org

Rule Path
Disallow /

archiveteam.org

Rule Path
Disallow /

surveybot_ignoreip

Rule Path
Disallow /

Other Records

Field Value
sitemap https://thegroovecartel.com/news-sitemap.xml
sitemap https://thegroovecartel.com/post-sitemap.xml