jausa.ja.org
robots.txt

Robots Exclusion Standard data for jausa.ja.org

Resource Scan

Scan Details

Site Domain jausa.ja.org
Base Domain ja.org
Scan Status Ok
Last Scan2024-10-22T01:11:53+00:00
Next Scan 2024-11-21T01:11:53+00:00

Last Scan

Scanned2024-10-22T01:11:53+00:00
URL https://jausa.ja.org/robots.txt
Domain IPs 52.26.63.57, 52.27.206.47, 52.33.79.69
Response IP 52.27.206.47
Found Yes
Hash e69f0786414415cdc145b0b0949591766d468a84fa48dc840d81aa61b2a94374
SimHash a705c873de13

Groups

pagefreezer
petalbot

Rule Path
Disallow /

*

Rule Path
Disallow /docs/webdav/autopub/*
Disallow /webdav/*
Disallow /api/*
Disallow /html/*
Disallow /*/javadocs/*
Disallow /javadocs/*
Disallow /rensai/*
Disallow /docs/1.*
Disallow /docs/2.*
Disallow /docs/3.*
Disallow /docs/4.*
Disallow /servlets/*
Disallow /download/nightly-builds
Disallow /download/nightly-builds*
Disallow /accelerators/*
Disallow /shop/*
Disallow /doc/dotmarketing/*
Disallow /docs/com/dotmarketing/*
Disallow /browse/*
Disallow /devwiki/*
Disallow /school/*
Disallow /person/*

Other Records

Field Value
sitemap https://jausa.ja.org/api/vtl/sitemap