livejapan.com
robots.txt

Robots Exclusion Standard data for livejapan.com

Resource Scan

Scan Details

Site Domain livejapan.com
Base Domain livejapan.com
Scan Status Ok
Last Scan2024-10-30T17:15:14+00:00
Next Scan 2024-11-06T17:15:14+00:00

Last Scan

Scanned2024-10-30T17:15:14+00:00
URL https://livejapan.com/robots.txt
Domain IPs 103.129.28.150
Response IP 103.129.28.150
Found Yes
Hash d7986bb637eec76208fbce1b188d4bd74cfc2cd166fe42e1f6d6d6719e60b0f7
SimHash 0d5d2d58e38b

Groups

*

Rule Path
Disallow /mobile/
Disallow /adultmovies/
Disallow /adultmovies/home.html
Disallow /m/
Disallow /bbs/
Disallow /adultmovies/bbs/
Disallow /*?utm_id=lj_teaser201602_pc_g_lis
Disallow /*print.html
Disallow /en/list/
Disallow /ja/list/
Disallow /ko/list/
Disallow /zh-cn/list/
Disallow /zh-tw/list/
Disallow /ms/list/
Disallow /id/list/
Disallow /th/list/

baiduspider
baiduspider+
baiduimagespider

Rule Path
Disallow /ajax/
Disallow /en/ajax/
Disallow /id/ajax/
Disallow /ja/ajax/
Disallow /ko/ajax/
Disallow /ms/ajax/
Disallow /th/ajax/
Disallow /zh-cn/ajax/
Disallow /zh-tw/ajax/
Disallow /en/list/
Disallow /ja/list/
Disallow /ko/list/
Disallow /zh-cn/list/
Disallow /zh-tw/list/
Disallow /ms/list/
Disallow /id/list/
Disallow /th/list/

adsbot-google
adsbot-google-mobile

Rule Path
Disallow /*print.html