latitude.sh
robots.txt

Robots Exclusion Standard data for latitude.sh

Resource Scan

Scan Details

Site Domain latitude.sh
Base Domain latitude.sh
Scan Status Ok
Last Scan2024-08-29T09:50:29+00:00
Next Scan 2024-09-28T09:50:29+00:00

Last Scan

Scanned2024-08-29T09:50:29+00:00
URL https://latitude.sh/robots.txt
Redirect https://www.latitude.sh/robots.txt
Redirect Domain www.latitude.sh
Redirect Base latitude.sh
Domain IPs 76.76.21.21
Redirect IPs 76.76.21.164, 76.76.21.9
Response IP 76.76.21.164
Found Yes
Hash ffeeb940548c70637d5536abf6c6d520785b8b685227c23592d70010f4c85a42
SimHash 69e88e99e520

Groups

*

Rule Path
Allow /api/og
Allow /dashboard$
Allow /dashboard/signup$
Disallow /dashboard/
Disallow /api/
Disallow /ip-geo-location.csv
Allow /

twitterbot

Rule Path
Allow /api/og

Other Records

Field Value
sitemap https://www.latitude.sh/sitemap.xml

Comments

  • Allow dynamic open graph images
  • Allow auth pages
  • Allow all other pages not disallowed above
  • Host
  • Sitemaps
  • Twitterbot doesn't seem to follow Googles robot.txt spec
  • This rule has been added to allow twitterbot to crawl images for sharing on twitter

Warnings

  • `host` is not a known field.