piliapp.com
robots.txt

Robots Exclusion Standard data for piliapp.com

Resource Scan

Scan Details

Site Domain piliapp.com
Base Domain piliapp.com
Scan Status Ok
Last Scan2024-11-08T11:13:06+00:00
Next Scan 2024-11-15T11:13:06+00:00

Last Scan

Scanned2024-11-08T11:13:06+00:00
URL https://piliapp.com/robots.txt
Domain IPs 172.66.40.203, 172.66.43.53, 2606:4700:3108::ac42:28cb, 2606:4700:3108::ac42:2b35
Response IP 172.66.40.203
Found Yes
Hash 512333d763c558c15f6f95d1af802250ec85dd6ccc6dbc4c516109733b6dd13a
SimHash 641d535cf000

Groups

*

Rule Path
Disallow /dev-*
Disallow /pili-*
Disallow /tool-*
Disallow /generator/qr-code/apps*
Disallow /tw-railway/result/
Disallow /actual-size/what-is-my-monitor-size/?next_device=*
Disallow /feedkback/
Disallow /lnk/*piliapp.com/*
Disallow /page/language/?app_uri=*

adsbot-google
mediapartners-google

Rule Path
Allow /tw-railway/result/

anthropic-ai
awariorssbot
awariosmartbot
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
dataforseobot
diffbot
facebookbot
gptbot
google-extended
magpie-crawler
newsnow
news-please
omgili
omgilibot
perplexitybot
scrapy
turnitinbot

Rule Path
Disallow /