tupalo.co
robots.txt
Robots Exclusion Standard data for tupalo.co
Resource Scan
Scan Details
Site Domain | tupalo.co |
Base Domain | tupalo.co |
Scan Status | Ok |
Last Scan | 2024-10-14T01:22:38+00:00 |
Next Scan | 2024-10-21T01:22:38+00:00 |
Last Scan
Scanned | 2024-10-14T01:22:38+00:00 |
URL | https://www.tupalo.co/robots.txt |
Domain IPs | 162.55.65.173 |
Response IP | 162.55.65.173 |
Found | Yes |
Hash | 8bdfcaa7169d9577e82dbdb94535fa8b06f073bf288d9d13479233c4bcca276a |
SimHash | 719c091bc6d4 |
Groups
*
Rule | Path |
---|---|
Disallow | /widgets/ |
Disallow | /mobile/ |
Disallow | /s/ |
Disallow | /s/*/togo |
Disallow | /s/*/been |
Disallow | /s/*/favorite |
Disallow | /s/*/reviews |
Disallow | /*/*/edit$ |
Disallow | /*/*/business$ |
Disallow | /s/*/lazy_load_pics |
Disallow | /*/s/*/near$ |
Disallow | /*/widget/ |
amazonbot
anthropic-ai
applebot-extended
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
diffbot
facebookbot
facebookexternalhit
friendlycrawler
google-extended
googleother
googleother-image
googleother-video
gptbot
icc-crawler
imagesiftbot
img2dataset
meta-externalagent
oai-searchbot
omgili
omgilibot
perplexitybot
petalbot
scrapy
timpibot
velenpublicwebcrawler
youbot
the knowledge ai
Rule | Path |
---|---|
Disallow | / |
Other Records
Field | Value |
---|---|
sitemap | https://sitemaps.tupalo.com/co_spots_sitemap_index_00.xml |
sitemap | https://sitemaps.tupalo.com/co_categories_sitemap_index_00.xml |
sitemap | https://sitemaps.tupalo.com/co_tags_sitemap_index_00.xml |
sitemap | https://sitemaps.tupalo.com/co_streets_sitemap_index_00.xml |
sitemap | https://sitemaps.tupalo.com/co_chains_sitemap_index_00.xml |
sitemap | https://sitemaps.tupalo.com/co_medias_sitemap_index_00.xml |
sitemap | https://sitemaps.tupalo.com/co_cities_sitemap_index_00.xml |
sitemap | https://www.tupalo.co/updates.xml |
Warnings
- `noindex` is not a known field.
Comments