insolesgeek.com
robots.txt

Robots Exclusion Standard data for insolesgeek.com

Resource Scan

Scan Details

Site Domain insolesgeek.com
Base Domain insolesgeek.com
Scan Status Ok
Last Scan2025-05-18T15:55:02+00:00
Next Scan 2025-06-17T15:55:02+00:00

Last Scan

Scanned2025-05-18T15:55:02+00:00
URL https://insolesgeek.com/robots.txt
Domain IPs 104.26.10.46, 104.26.11.46, 172.67.75.62, 2606:4700:20::681a:a2e, 2606:4700:20::681a:b2e, 2606:4700:20::ac43:4b3e
Response IP 104.26.10.46
Found Yes
Hash 8212195a6fd9cde19213969b40b2007d49f6c5dde692479041e059a9cd2a9fb8
SimHash e53d0946f513

Groups

*

Rule Path
Disallow /cache/
Disallow /cgi-bin/
Disallow /logs/
Disallow /cdn-cgi/
Disallow /index.php?main_page=products_all
Disallow /index.php?main_page=featured_products
Disallow /index.php?main_page=products_new
Disallow /index.php?main_page=specials
Disallow /index.php?main_page=testimonials_manager&testimonials_id=
Disallow /index.php?main_page=compare
Disallow /index.php?main_page=testimonials_manager

Other Records

Field Value
sitemap https://insolesgeek.com/sitemap.xml