- May 25, 2013
-
-
Raymond Toy authored
functions now live in the new UNICODE package. src/code/exports.lisp:: * Export some unicode functions and constants. src/code/string.lisp:: * Removed the extended versions of string-upcase and friends. * Export surrogates function. * Make sure with-one-string is defined so the unicode package can use it. src/code/unicode.lisp:; * New file with extended versions of string-upcase and friends. src/code/unidata.lisp:: * Export some unicode functions and constants. src/compiler/fndb.lisp:: * Update defknowns for string-upcase and friends. src/tools/worldbuild.lisp:: * Build unicode.lisp src/tools/worldcom.lisp:: * Load unicode.lisp
-
- Mar 05, 2013
-
-
Raymond Toy authored
Still needs work because the word-break tests fail.
-
- Jan 17, 2013
-
-
Raymond Toy authored
Change *unidata-path* to be a pathname object instead of a namestring.
-
- Nov 18, 2012
-
-
Raymond Toy authored
but the old version could. This unfixes Trac #52.
-
- Mar 03, 2012
-
-
Raymond Toy authored
signaling an error.
-
- Feb 05, 2012
-
-
Raymond Toy authored
src/code/unidata.lisp:: Update Unicode version. src/tools/build-unidata.lisp:: Update Unicode version and update for change of format of NameAliases.txt. src/i18n/unidata.bin:: Updated with new data. src/general-info/release-20d.txt:: Updated. src/i18n/BidiMirroring.txt:: Updated to Unicode 6.1.0. src/i18n/CaseFolding.txt:: Updated to Unicode 6.1.0. src/i18n/CompositionExclusions.txt:: Updated to Unicode 6.1.0. src/i18n/DerivedNormalizationProps.txt:: Updated to Unicode 6.1.0. src/i18n/NameAliases.txt:: Updated to Unicode 6.1.0. src/i18n/NormalizationCorrections.txt:; src/i18n/SpecialCasing.txt:: Updated to Unicode 6.1.0. src/i18n/UnicodeData.txt:: Updated to Unicode 6.1.0. src/i18n/WordBreakProperty.txt:: Updated to Unicode 6.1.0. src/i18n/tests/NormalizationTest.txt:: Updated to Unicode 6.1.0. src/i18n/tests/WordBreakTest.txt:: Updated to Unicode 6.1.0.
-
- Feb 01, 2012
-
-
Raymond Toy authored
list.
-
Raymond Toy authored
Thanks to Paul Foley for rewriting {{{UNICODE-COMPLETE-NAME}}} to make it work.
-
- Nov 04, 2011
-
-
Raymond Toy authored
-
- Sep 25, 2011
-
-
Raymond Toy authored
entries with just the file path, removing the revision number, date, author and state. The actual information is now computed during compilation and stored in the fasl itself. (See ticket:48.)
-
- Jun 27, 2011
-
-
rtoy authored
code/unidata.lisp: o Update unicode version to 6.0.0 o Add pointer to build-unidata.lisp. tools/build-unidata.lisp: o Update unicode version to 6.0.0 o Print out directory path so we can see where we're getting the data from. i18n/CaseFolding.txt i18n/CompositionExclusions.txt i18n/DerivedNormalizationProps.txt i18n/NameAliases.txt i18n/NormalizationCorrections.txt i18n/SpecialCasing.txt i18n/UnicodeData.txt i18n/WordBreakProperty.txt i18n/tests/NormalizationTest.txt i18n/tests/WordBreakTest.txt: o Update with new files from unicode.org.
-
- Jun 10, 2011
-
-
rtoy authored
This makes it easy to make an executable image that doesn't need unidata.bin around. (Should we do this for normal cores? It seems to add about 1 MB to the core size.) code/unidata.lisp: o Add LOAD-ALL-UNICODE-DATA to load all unicode data. o Add UNICODE-DATA-LOADED-P to check that unicode data has been loaded. code/print.lisp: o If unicode data is loaded, don't check for existence of *unidata-path*, because we don't need it. code/exports.lisp: o Export LOAD-ALL-UNICODE-DATA. general-info/release-20c.txt: o Update info
-
- May 31, 2011
-
-
rtoy authored
This change requires a cross-compile. Use boot-2011-04-01-cross.lisp as the cross-compile script. bootfiles/20b/boot-2011-04-01-cross.lisp: o New cross-compile bootstrap file lisp/lisp.c: o Recognize -unidata option and setup *UNIDATA-PATH* appropriately. code/commandline.lisp: o Add defswitch for unidata so we don't get complaints about unknown switch. code/unidata.lisp: o Rename +UNIDATA-PATH+ to *UNIDATA-PATH*, since it's not a constant anymore. o Update code to use new name. code/print.lisp: o Update code to use *UNIDATA-PATH* compiler/sparc/parms.lisp: o Add *UNIDATA-PATH* to list of static symbols. o Add back in spare-9 and spare-8 static symbols since we need to do a cross-compile for this change anyway. compiler/x86/parms.lisp: o Add *UNIDATA-PATH* to list of static symbols. o Reorder the static symbols in a more logical arrangment so that the spare symbols are at the end. i18n/local/cmucl.pot: o Update
-
- Apr 02, 2011
-
-
rtoy authored
-
- Feb 23, 2011
-
-
rtoy authored
#\latin_small_letter_a. This failure is caused by the new SEARCH-DICTIONARY function that does partial completion, and UNICODE-NAME-TO-CODEPOINT function wan't aware of the new way. We could change UNICODE-NAME-TO-CODEPOINT to do the appropriate thing with the new way, but I (rtoy) decided it would be nice to have the old function around too. Hence, restore the old version and use it.
-
- Sep 29, 2010
-
-
rtoy authored
as the key for the composition table. That way the logic is in exactly one place and not spread out through the code.
-
- Sep 21, 2010
-
-
rtoy authored
original completions along with the extensions.
-
- Sep 20, 2010
-
-
rtoy authored
case we were returning the prefix string, but that would be incorrect if the prefix string is not a valid character. So check that it is valid and return it. Otherwise do nothing (thereby returning nil) so slime can note the character is invalid.
-
rtoy authored
more and fix some bugs in previous change.
-
- Sep 19, 2010
-
-
rtoy authored
(because they're so simple). o Add some comments for %STR. o Change implementation of %MATCH to be simpler and add comments on why we do what we do and explain what happens if we don't. o Handle completion of Hangul syllables better: - Match "Hangul_S" instead of "Hangul_Syllable" because there's #\Hangul_Single_Dot_Tone_Mark. - If we match "Hangul_S", try to complete some Hangul syllables so we don't fool slime into thinking "Hangul_Syllable_" is the only completion. There are obviously more. o Handle completion of CJK Unified Ideographs better by trying to complete more so slime isn't fooled into thinking "CJK_Unified_Ideograph-" is the only possible completion.
-
rtoy authored
the constraints on the codebook, we just sort them in descreasing order of length. o In %MIP, it might happen that MISMATCH returns NIL, which means a match. In this case, don't change the position.
-
- Sep 18, 2010
-
-
rtoy authored
Redo this by looping over all codepoints and selecting the codepoints that are Hangul syllables.
-
rtoy authored
o Update constants to Unicode version 5.2.0. i18n/unidata.bin: o Regenerated using Unicode version 5.2.0.
-
rtoy authored
o Just add some comments on why we don't put the dictionaries in unidata.bin. o Print out some messages when building the hangul and cjk dictionaries so the user knows what's happening. tools/build-unidata.lisp: o Add some comments on the various parts of unidata.bin.
-
- Sep 17, 2010
-
-
rtoy authored
o Export STRING-TO-NFC, UNICODE-COMPLETE, and UNICODE-COMPLETE-NAME. unidata.lisp: o Add explicit exports.
-
rtoy authored
ideographs by using dictionaries. (Should these dictionaries be part of unidata.bin so they don't have to be built at run time? One the one hand, it makes things simpler, but unnecessarily bloats unidata.in. I suspect the hangul syllables and cjk ideographs characters not not used very often.) o Change NODE-NEXT and CLOSE-NODE to have an optional parameter for the dictionary to use. o Update UNICODE-COMPLETE-NAME to pass the dictionary to NODE-NEXT and CLOSE-NODE. o Update UNICODE-COMPLETE to use the hangul syllable dictionary and the cjk ideograph dictionary when searching. o Fix typo in UNICODE-COMPLETE. o Add defvars for dictionaries for hangul syllables and cjk ideographs. o Add functions to build the hangul and cjk dictionaries. o Steal the implementations of BUILD-DICTIONARY, NAME-LOOKUP, and ENCODE-NAME from tools/build-unidata.lisp.
-
rtoy authored
support character completion for slime. The implementation is from Paul Foley, but some slight modifications by Raymond Toy to handle a few corner cases. o Modify SEARCH-DICTIONARY to take optional current and posn parameters so that SEARCH-DICTIONARY can be started from a different place. o Add UNICODE-COMPLETE, which is the main function for character name completion. o Add other support functions for UNICODE-COMPLETE.
-
rtoy authored
hangule-syllable-p.) o Move the computation of *reverse-hangule-choseong*, *reverse-hangul-jungseong*, and *reverse-hangul-jongseong* to its own routine. Call it in UNICODE-NAME-TO-CODEPOINT.
-
- Sep 15, 2010
-
-
rtoy authored
put the tests into their own functions so that the limits are on one place.
-
rtoy authored
code/string.lisp: o In %compose, handle the case where the composite character is outside the BMP and thus needs special handling for our UTF-16 strings. code/unidata.lisp o CKJ Ideograph range has changed in 5.2. o Fix bug in build-composition-table. We were not correctly handling the case where the decomposition of a codepoint was outside the BMP. Special care is needed to handle the UTF-16 strings that we use. o The key for the pairwise composition table are the full codepoints, so we need to shift one by 21 bits instead of 16. tools/build-unidata.lisp o Update minor version to 2. i18n/BidiMirroring.txt i18n/CaseFolding.txt i18n/CompositionExclusions.txt i18n/DerivedNormalizationProps.txt i18n/NameAliases.txt i18n/NormalizationCorrections.txt i18n/SpecialCasing.txt i18n/UnicodeData.txt i18n/WordBreakProperty.txt i18n/tests/NormalizationTest.txt i18n/tests/WordBreakTest.txt o Updated from Unicode 5.2. i18n/unidata.bin o Regenerated from new Unicode 5.2 files.
-
- Apr 20, 2010
-
-
rtoy authored
may get confused with source locations if the reader macros are installed.
-
- Mar 19, 2010
-
-
rtoy authored
boot-2010-02-1 as the bootstrap file. You should probably also use the new -P option for build.sh to generate and update the po files while building.
-
- Sep 11, 2009
-
-
rtoy authored
o Add support for word break properties. o Some cleanup of the code including moving the common code in write-ntrie* to write-ntrie. code/unidata.lisp: o Add support for word break properties. o UNICODE-WORD-BREAK-CODE and UNICODE-WORD-BREAK return the property code and the property keyword for a codepoint, respectively. i18n/WordBreakProperty.txt: o New file for the word break properties.
-
- Jul 10, 2009
-
-
rtoy authored
o Add *unidata-version* to hold our revision number. save.lisp: o Add Unicode to the herald items. Just print out the unidata version along with the supported Unicode UCD version.
-
- Jul 02, 2009
-
-
rtoy authored
o Bootstrap file needed to compile this change (because the current shrink-vector derive-type optimizer didn't handle union types). compiler/fndb.lisp: o Make the compiler warn if the result of lisp::shrink-vector is not used. This is a problem because the compiler doesn't know that shrink-vector destructively modifies the length of a vector. As a partial solution, warn the user if the result of shrink-vector is not. code/hash-new.lisp: code/seq.lisp: o Make sure the result of shrink-vector is used, to get rid of a new compiler warning. code/unidata.lisp: o Modify %unicode-full-case so that it doesn't use shrink-vector anymore. compiler/seqtran.lisp: o Fix shrink-vector derive-type optimizer to handle union types. tools/build-unidata.lisp: o Fix typo that someone got in. o Make sure the result of shrink-vector is used, to get rid of a new compiler warning.
-
- Jun 16, 2009
-
-
rtoy authored
o Only define STRING-TO-NFD, STRING-TO-NFKD, and STRING-TO-NFKC for Unicode builds. Conditionalize out their support functions too. o Update export list to be conditional on Unicode too. o Use new name for get-pairwise-composition. code/exports.lisp: o Update export list to be conditional on Unicode for above changes in string.lisp. code/unidata.lisp: o Change name from GET-PAIRWISE-COMPOSITION to UNICODE-PAIRWISE-COMPOSITION to match other Unicode function names.
-
- Jun 11, 2009
-
-
rtoy authored
unicode-utf16-extfmt-2009-06-11.
-