taposapp.com
robots.txt

Robots Exclusion Standard data for taposapp.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	taposapp.com
Base Domain	taposapp.com
Scan Status	Ok
Last Scan	2026-02-06T03:29:02+00:00
Next Scan	2026-03-08T03:29:02+00:00

Last Scan

Scanned	2026-02-06T03:29:02+00:00
URL	https://taposapp.com/robots.txt
Domain IPs	104.26.8.106, 104.26.9.106, 172.67.73.151, 2606:4700:20::681a:86a, 2606:4700:20::681a:96a, 2606:4700:20::ac43:4997
Response IP	172.67.73.151
Found	Yes
Hash	e59c40178e8912f4493cdb8313d7d7c6429b03742be150588780f1f328a2cc0c
SimHash	46b44b13c594

Groups

*

Rule	Path
Allow	/

Rule

Path

Allow

/

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

applebot-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

/

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

/

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

/

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

meta-externalagent

Rule	Path
Disallow	/

Rule

Path

Disallow

/

gptbot
claudebot
claude-user
claude-searchbot
ccbot
google-extended
applebot-extended
facebookbot
meta-externalagent
meta-externalfetcher
diffbot
perplexitybot
perplexity‑user
omgili
omgilibot
webzio-extended
imagesiftbot
bytespider
tiktokspider
amazonbot
youbot
semrushbot-ocob
petalbot
velenpublicwebcrawler
turnitinbot
timpibot
oai-searchbot
icc-crawler
ai2bot
ai2bot-dolma
dataforseobot
awariobot
awariosmartbot
awariorssbot
google-cloudvertexbot
pangubot
kangaroo bot
sentibot
img2dataset
meltwater
seekr
peer39_crawler
cohere-ai
cohere-training-data-crawler
duckassistbot
scrapy
cotoyogi
aihitbot
factset_spyderbot
firecrawlagent

Rule	Path
Disallow	/

Rule

Path

Disallow

/

*

Rule	Path
Allow	/

Rule

Path

Allow

/

Back to top

Comments

As a condition of accessing this website, you agree to abide by the following
content signals:
(a) If a Content-Signal = yes, you may collect content for the corresponding
use.
(b) If a Content-Signal = no, you may not collect content for the
corresponding use.
(c) If the website operator does not include a Content-Signal for a
corresponding use, the website operator neither grants nor restricts
permission via Content-Signal with respect to the corresponding use.
The content signals and their meanings are:
search: building a search index and providing search results (e.g., returning
hyperlinks and short excerpts from your website's contents). Search does not
include providing AI-generated search summaries.
ai-input: inputting content into one or more AI models (e.g., retrieval
augmented generation, grounding, or other real-time taking of content for
generative AI search answers).
ai-train: training or fine-tuning AI models.
ANY RESTRICTIONS EXPRESSED VIA CONTENT SIGNALS ARE EXPRESS RESERVATIONS OF
AND RELATED RIGHTS IN THE DIGITAL SINGLE MARKET.
BEGIN Cloudflare Managed content
END Cloudflare Managed Content
Block all known AI crawlers and assistants
from using content for training AI models.
Source: https://robotstxt.com/ai
Block any non-specified AI crawlers (e.g., new
or unknown bots) from using content for training
AI models, while allowing the website to be
indexed and accessed by bots. These directives
are still experimental and may not be supported
by all AI crawlers.

Back to top

Warnings

`content-signal` is not a known field.
`content-usage` is not a known field.
`disallowaitraining` is not a known field.

Back to top

taposapp.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

amazonbot

applebot-extended

bytespider

ccbot

claudebot

google-extended

gptbot

meta-externalagent

*

Comments

Warnings

taposapp.com
robots.txt