tupalo.com
robots.txt

Robots Exclusion Standard data for tupalo.com

Resource Scan

Scan Details

Site Domain tupalo.com
Base Domain tupalo.com
Scan Status Ok
Last Scan2024-09-22T09:45:55+00:00
Next Scan 2024-09-29T09:45:55+00:00

Last Scan

Scanned2024-09-22T09:45:55+00:00
URL https://tupalo.com/robots.txt
Domain IPs 162.55.65.173
Response IP 162.55.65.173
Found Yes
Hash 6ce21aa916ddb6e5353eb455807bee6c480d81c17e63fbdac6d1916a8766a845
SimHash 781e4b51a3d1

Groups

*

Rule Path
Disallow /widgets/
Disallow /mobile/
Disallow /en/s/
Disallow /en/s/*/togo
Disallow /en/s/*/been
Disallow /en/s/*/favorite
Disallow /en/s/*/reviews
Disallow /en/*/*/edit$
Disallow /en/*/*/business$
Disallow /en/s/*/lazy_load_pics
Disallow /en/*/s/*/near$
Disallow /en/review_widget
Disallow /en/my_business$
Disallow /en/widget/
Disallow /de/s/
Disallow /de/s/*/togo
Disallow /de/s/*/been
Disallow /de/s/*/favorite
Disallow /de/s/*/reviews
Disallow /de/*/*/edit$
Disallow /de/*/*/business$
Disallow /de/s/*/lazy_load_pics
Disallow /de/*/s/*/near$
Disallow /de/review_widget
Disallow /de/my_business$
Disallow /de/widget/
Disallow /nl/s/
Disallow /nl/s/*/togo
Disallow /nl/s/*/been
Disallow /nl/s/*/favorite
Disallow /nl/s/*/reviews
Disallow /nl/*/*/edit$
Disallow /nl/*/*/business$
Disallow /nl/s/*/lazy_load_pics
Disallow /nl/*/s/*/near$
Disallow /nl/review_widget
Disallow /nl/my_business$
Disallow /nl/widget/
Disallow /fi/s/
Disallow /fi/s/*/togo
Disallow /fi/s/*/been
Disallow /fi/s/*/favorite
Disallow /fi/s/*/reviews
Disallow /fi/*/*/edit$
Disallow /fi/*/*/business$
Disallow /fi/s/*/lazy_load_pics
Disallow /fi/*/s/*/near$
Disallow /fi/review_widget
Disallow /fi/my_business$
Disallow /fi/widget/
Disallow /pl/s/
Disallow /pl/s/*/togo
Disallow /pl/s/*/been
Disallow /pl/s/*/favorite
Disallow /pl/s/*/reviews
Disallow /pl/*/*/edit$
Disallow /pl/*/*/business$
Disallow /pl/s/*/lazy_load_pics
Disallow /pl/*/s/*/near$
Disallow /pl/review_widget
Disallow /pl/my_business$
Disallow /pl/widget/
Disallow /da/s/
Disallow /da/s/*/togo
Disallow /da/s/*/been
Disallow /da/s/*/favorite
Disallow /da/s/*/reviews
Disallow /da/*/*/edit$
Disallow /da/*/*/business$
Disallow /da/s/*/lazy_load_pics
Disallow /da/*/s/*/near$
Disallow /da/review_widget
Disallow /da/my_business$
Disallow /da/widget/
Disallow /sv/s/
Disallow /sv/s/*/togo
Disallow /sv/s/*/been
Disallow /sv/s/*/favorite
Disallow /sv/s/*/reviews
Disallow /sv/*/*/edit$
Disallow /sv/*/*/business$
Disallow /sv/s/*/lazy_load_pics
Disallow /sv/*/s/*/near$
Disallow /sv/review_widget
Disallow /sv/my_business$
Disallow /sv/widget/
Disallow /fr/s/
Disallow /fr/s/*/togo
Disallow /fr/s/*/been
Disallow /fr/s/*/favorite
Disallow /fr/s/*/reviews
Disallow /fr/*/*/edit$
Disallow /fr/*/*/business$
Disallow /fr/s/*/lazy_load_pics
Disallow /fr/*/s/*/near$
Disallow /fr/review_widget
Disallow /fr/my_business$
Disallow /fr/widget/

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

applebot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

amazonbot
anthropic-ai
applebot-extended
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
diffbot
facebookbot
facebookexternalhit
friendlycrawler
google-extended
googleother
googleother-image
googleother-video
gptbot
icc-crawler
imagesiftbot
img2dataset
meta-externalagent
oai-searchbot
omgili
omgilibot
perplexitybot
petalbot
scrapy
timpibot
velenpublicwebcrawler
youbot
the knowledge ai

Rule Path
Disallow /

Other Records

Field Value
sitemap https://sitemaps.tupalo.com/com_spots_sitemap_index_00.xml
sitemap https://sitemaps.tupalo.com/com_categories_sitemap_index_00.xml
sitemap https://sitemaps.tupalo.com/com_tags_sitemap_index_00.xml
sitemap https://sitemaps.tupalo.com/com_streets_sitemap_index_00.xml
sitemap https://sitemaps.tupalo.com/com_chains_sitemap_index_00.xml
sitemap https://sitemaps.tupalo.com/com_medias_sitemap_index_00.xml
sitemap https://sitemaps.tupalo.com/com_cities_sitemap_index_00.xml

Comments

  • Hey robot! Have a good time at Tupalo.com!

Warnings

  • `noindex` is not a known field.