unicode-table.com
robots.txt

Robots Exclusion Standard data for unicode-table.com

Resource Scan

Scan Details

Site Domain unicode-table.com
Base Domain unicode-table.com
Scan Status Ok
Last Scan2024-09-21T04:02:04+00:00
Next Scan 2024-09-28T04:02:04+00:00

Last Scan

Scanned2024-09-21T04:02:04+00:00
URL https://unicode-table.com/robots.txt
Redirect https://symbl.cc/robots.txt
Redirect Domain symbl.cc
Redirect Base symbl.cc
Redirect IPs 138.201.57.225
Response IP 138.201.57.225
Found Yes
Hash 2b61b51bbd1b081ac624630c8d6983895d77225b1f4a94fe4c21ec529129dcbe
SimHash 09d0900afb53

Groups

*

Rule Path
Allow /
Disallow /*/types/
Disallow /*/versions/
Disallow /*/about/
Disallow /*/search/
Disallow /*?*
Disallow /*/planes/
Disallow /*/countries/

googlebot

Rule Path
Disallow /*?*
Disallow /*/countries/
Disallow /*/planes/
Disallow /*/types/
Disallow /*/versions/
Disallow /*/about/
Disallow /*/search/
Disallow /*?*path*
Disallow /*?*ivk_sa*
Disallow /*?*hl*
Disallow /*?*ref*
Disallow /*?*nomobile*
Disallow /*?*msclkid*
Disallow /*?*_ym_debug*

yandex

Rule Path
Allow /
Disallow /*?*
Disallow /*/countries/
Disallow /*/planes/
Disallow /*/types/
Disallow /*/versions/
Disallow /*/about/
Disallow /*/search/
Disallow /cn/
Disallow /hi/
Disallow /kr/
Disallow /jp/
Disallow /th/

Warnings

  • `clean-param` is not a known field.