glacialairsystems.com
robots.txt

Robots Exclusion Standard data for glacialairsystems.com

Resource Scan

Scan Details

Site Domain glacialairsystems.com
Base Domain glacialairsystems.com
Scan Status Ok
Last Scan2025-07-22T19:55:35+00:00
Next Scan 2025-08-21T19:55:35+00:00

Last Scan

Scanned2025-07-22T19:55:35+00:00
URL https://glacialairsystems.com/robots.txt
Domain IPs 2a02:4780:15:19be:d1ab:1d90:774a:8a94, 2a02:4780:16:e0a7:711b:5215:42fd:12b5, 84.32.84.0, 84.32.84.144
Response IP 77.37.115.163
Found Yes
Hash 9100b52c6ea8b54f2ed82c23b308fb9c74d579366fc8c0351b75e56c31fba77e
SimHash b2a033922682

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Allow /
Disallow /?s=
Disallow /search/
Disallow /page/

Other Records

Field Value
sitemap https://glacialairsystems.com/wp-sitemap.xml
sitemap https://glacialairsystems.com/sitemap.xml
sitemap https://glacialairsystems.com/sitemap-news.xml
sitemap https://glacialairsystems.com/sitemap.rss

Comments

  • robots.txt for Glacial Air Systems
  • Prevent crawling of internal WordPress search and pagination URLs
  • Sitemap declarations
  • Link to AI permission file

Warnings

  • `llm-txt` is not a known field.