tupalo.co
robots.txt

Robots Exclusion Standard data for tupalo.co

Resource Scan

Scan Details

Site Domain tupalo.co
Base Domain tupalo.co
Scan Status Ok
Last Scan2024-10-14T01:22:38+00:00
Next Scan 2024-10-21T01:22:38+00:00

Last Scan

Scanned2024-10-14T01:22:38+00:00
URL https://www.tupalo.co/robots.txt
Domain IPs 162.55.65.173
Response IP 162.55.65.173
Found Yes
Hash 8bdfcaa7169d9577e82dbdb94535fa8b06f073bf288d9d13479233c4bcca276a
SimHash 719c091bc6d4

Groups

*

Rule Path
Disallow /widgets/
Disallow /mobile/
Disallow /s/
Disallow /s/*/togo
Disallow /s/*/been
Disallow /s/*/favorite
Disallow /s/*/reviews
Disallow /*/*/edit$
Disallow /*/*/business$
Disallow /s/*/lazy_load_pics
Disallow /*/s/*/near$
Disallow /*/widget/

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

applebot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

amazonbot
anthropic-ai
applebot-extended
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
diffbot
facebookbot
facebookexternalhit
friendlycrawler
google-extended
googleother
googleother-image
googleother-video
gptbot
icc-crawler
imagesiftbot
img2dataset
meta-externalagent
oai-searchbot
omgili
omgilibot
perplexitybot
petalbot
scrapy
timpibot
velenpublicwebcrawler
youbot
the knowledge ai

Rule Path
Disallow /

Other Records

Field Value
sitemap https://sitemaps.tupalo.com/co_spots_sitemap_index_00.xml
sitemap https://sitemaps.tupalo.com/co_categories_sitemap_index_00.xml
sitemap https://sitemaps.tupalo.com/co_tags_sitemap_index_00.xml
sitemap https://sitemaps.tupalo.com/co_streets_sitemap_index_00.xml
sitemap https://sitemaps.tupalo.com/co_chains_sitemap_index_00.xml
sitemap https://sitemaps.tupalo.com/co_medias_sitemap_index_00.xml
sitemap https://sitemaps.tupalo.com/co_cities_sitemap_index_00.xml
sitemap https://www.tupalo.co/updates.xml

Comments

  • Hey robot! Have a good time at Tupalo.com!

Warnings

  • `noindex` is not a known field.