his-j.com
robots.txt

Robots Exclusion Standard data for his-j.com

Resource Scan

Scan Details

Site Domain his-j.com
Base Domain his-j.com
Scan Status Ok
Last Scan2024-09-21T07:49:29+00:00
Next Scan 2024-09-28T07:49:29+00:00

Last Scan

Scanned2024-09-21T07:49:29+00:00
URL https://his-j.com/robots.txt
Redirect https://www.his-j.com:443/robots.txt
Redirect Domain www.his-j.com
Redirect Base his-j.com
Domain IPs 13.112.2.54, 35.78.23.105
Redirect IPs 23.203.73.8
Response IP 59.151.128.80
Found Yes
Hash a74332f508b5f62bb3b94c25fa9791b9523c3c036e54edce7d45f3a362d33fea
SimHash 2d0df84a0b14

Groups

*

Rule Path
Disallow /*.pdf$
Disallow /*.xls$
Disallow /*.xlsx$
Disallow /*.doc$
Disallow /*.docx$
Disallow /json.asp
Disallow /hotel/json.asp
Disallow /js/cal/search_form.html
Disallow /kix/*/tour_search/
Disallow /kix/executive/tour_search.html
Disallow /*ah_iframe.html
Disallow /okinawa/op/step1/
Disallow /okinawa/op/stock/
Disallow /tyo/air/sale/airhtl_form/
Disallow /tyo/air_hotel/airhtl_form/
Disallow /tyo/air_hotel/widget/
Disallow /tyo/special/etihad/wigget/
Disallow /ngo/air_hotel/airhtl_form/
Disallow /ngo/air_hotel/airhtl_form_fair/
Disallow /ngo/area/*/parts/airhtl_form/search.html
Disallow /ngo/special/zekkei/search/air_pc.html

Other Records

Field Value
sitemap https://www.his-j.com/sitemap.xml
sitemap https://www.his-j.com/kaigai/air/sitemap.xml
sitemap https://www.his-j.com/corp/sitemap.xml