upperinc.com
robots.txt

Robots Exclusion Standard data for upperinc.com

Resource Scan

Scan Details

Site Domain upperinc.com
Base Domain upperinc.com
Scan Status Ok
Last Scan2025-12-25T18:28:40+00:00
Next Scan 2026-01-24T18:28:40+00:00

Last Scan

Scanned2025-12-25T18:28:40+00:00
URL https://upperinc.com/robots.txt
Domain IPs 104.26.10.182, 104.26.11.182, 172.67.73.168, 2606:4700:20::681a:ab6, 2606:4700:20::681a:bb6, 2606:4700:20::ac43:49a8
Response IP 104.26.10.182
Found Yes
Hash e88d036bcb0737ca09d3a35ff8f2642661056c9425943efc2c1a5351a7b48deb
SimHash 50b07cd3def3

Groups

*

Rule Path
Allow /

oai-searchbot
chatgpt-user
perplexitybot
firecrawlagent
andibot
exabot
phindbot
youbot
cloudbot
gemini

Rule Path
Allow /
Disallow /wp-admin/
Disallow /internal/
Disallow /cgi-bin/
Disallow /tmp/
Disallow /cdn-cgi/*
Disallow /cgi-bin
Allow /wp-admin/admin-ajax.php
Disallow /*.pdf
Disallow /lp/
Disallow /previews/*

Other Records

Field Value
sitemap https://www.upperinc.com/sitemap_index.xml

Comments

  • Allow traditional search indexing
  • Allow AI search and agent use
  • Disallow access to admin areas for all bots
  • Allow admin-ajax.php
  • Specific file exclusions
  • Landing page exclusions
  • Preview exclusions
  • Sitemap locations

Warnings

  • `noindex` is not a known field.