leonardwong.tech
robots.txt
Robots Exclusion Standard data for leonardwong.tech
Resource Scan
Scan Details
Site Domain | leonardwong.tech |
Base Domain | leonardwong.tech |
Scan Status | Ok |
Last Scan | 2025-04-18T15:29:48+00:00 |
Next Scan | 2025-05-02T15:29:48+00:00 |
Last Scan
Scanned | 2025-04-18T15:29:48+00:00 |
URL | https://leonardwong.tech/robots.txt |
Domain IPs | 104.21.59.171, 172.67.181.93, 2606:4700:3036::ac43:b55d, 2606:4700:3037::6815:3bab |
Response IP | 172.67.181.93 |
Found | Yes |
Hash | 4dfba0ae443c6972e180cda54b694d631c9e2eac922ed189d8c74840c84c9b8f |
SimHash | 5408f941e2a0 |
Groups
*
Rule | Path |
---|---|
Disallow | /resume.docx |
adsbot-google
amazonbot
anthropic-ai
applebot-extended
awariorssbot
awariosmartbot
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
dataforseobot
diffbot
facebookbot
friendlycrawler
google-extended
googleother
gptbot
img2dataset
imagesiftbot
magpie-crawler
meltwater
omgili
omgilibot
peer39_crawler
peer39_crawler/1.0
perplexitybot
piplbot
scoop.it
seekr
youbot
Rule | Path |
---|---|
Disallow | / |