yes319.com
robots.txt

Robots Exclusion Standard data for yes319.com

Resource Scan

Scan Details

Site Domain yes319.com
Base Domain yes319.com
Scan Status Ok
Last Scan2024-09-21T14:25:24+00:00
Next Scan 2024-09-28T14:25:24+00:00

Last Scan

Scanned2024-09-21T14:25:24+00:00
URL https://yes319.com/robots.txt
Domain IPs 34.80.94.99
Response IP 34.80.94.99
Found Yes
Hash 77364948052fb487a482fb95da6fa8889fa2dd69fe1e744a875ea477a077954c
SimHash 587833362263

Groups

*

Rule Path
Disallow /admin/
Disallow /house/
Disallow /admin257/
Disallow /admin355/
Disallow /cn/
Disallow /000/
Disallow /001/
Disallow /log/
Disallow /api/
Disallow /console/
Disallow /*.xls$
Disallow /*.doc$
Disallow /*.pdf$

baiduspider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou orion spider

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

outfoxbot

Rule Path
Disallow /

yodaobot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

trovitbot

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

proximic

Rule Path
Disallow /

backlinkcrawler

Rule Path
Disallow /

checkmarknetwork/1.0 (+http://www.checkmarknetwork.com/spider.html)

Rule Path
Disallow /

ltx71 - (http://ltx71.com/)

Rule Path
Disallow /

ttd-content

Rule Path
Disallow /

admantx

Rule Path
Disallow /

ioncrawl

Rule Path
Disallow /

yeti

Rule Path
Disallow /

Warnings

  • 2 invalid lines.