Parent Directory | Revision Log
|Links to HEAD:||(view) (annotate)|
|Links to intl-2-branch:||(view) (annotate)|
Restart internalization work. This new branch starts with code from the intl-branch on date 2010-02-12 18:00:00+0500. This version works and LANG=en@piglatin bin/lisp works (once the piglatin translation is added).
Merge changes from unicode-string-buffer-impl-branch which gives faster reads on external-formats. This is done by adding an additional buffer to streams so we can convert the entire in-buffer into characters all at once. To build this change, you need to do a cross-compile using boot-2009-10-1-cross.lisp. Using that build, do a normal build with these sources. For a non-unicode build use boot-2009-10-01.lisp with a 20a non-unicode build. code/extfmts.lisp: o Add another slot to the extfmts for copying the state. o Modify EF-OCTETS-TO-STRING and OCTETS-TO-STRING to support the necesssary changes for fast formats. This is incompatible with the previous version because the string is not grown if needed. code/fd-stream-extfmt.lisp: o Set *enable-stream-buffer-p* to T so we have fast streams. code/fd-stream.lisp: o Add new slots to support fast strams. o In SET-ROUTINES, initialize the new slots appropriately. o Update UNREAD-CHAR to be able to back up in the string buffer to unread. o Add implementation to copy the state of an external format. code/stream.lisp: o Change %SET-FD-STREAM-EXTERNAL-FORMAT to be able to change formats even if we've already converted the buffer with a different format. We reconvert the buffer with the old format until we reach the current character. Then the remaining octets are converted using the new format and stored in the string buffer. o Add FAST-READ-CHAR-STRING-REFILL to refill the string buffer, like FAST-READ-CHAR-REFILL does for the octet in-buffer. code/struct.lisp: o Add new slots to hold the string buffer, the current index, and length. These are needed for the fast formats. code/sysmacs.lisp: o Update PREPARE-FOR-FAST-READ-CHAR, DONE-WITH-FAST-READ-CHAR, and FAST-READ-CHAR to support the string buffer. code/string.lisp: o Microoptimization of SURROGATEP to reduce the number of branchs. general-info/release-20b.txt: o Update with these changes pcl/simple-streams/external-formats/utf-16-be.lisp: pcl/simple-streams/external-formats/utf-16-le.lisp: pcl/simple-streams/external-formats/utf-16.lisp: o These formats actually have state, so update them to take a handle an initial state. These are needed if the string buffer ends with a leading surrogate and the next string buffer starts with a trailing surrogate. The conversion needs to combine the surrogates together.
Add docstring for SET-SYSTEM-EXTERNAL-FORMAT.
Merge changes from the release-20a branch.
Merge changes from 20a branch.
Merge changes from 20a-pre1 (tag release-20a-pre1) to trunk.
Add support for flushing out any state in an external format when closing an output stream. This causes things like (with-open-file (s "foo" :direction :output :external-format :utf-8) (write-char #\u+d800 s)) to output the replacement character instead of creating an empty file. code/extfmts.lisp: o Add new slot to efx structure to hold the function to flush the state in an external format. o Add accessor for the flush-state slot. o Update DEFINE-EXTERNAL-FORMAT to allow specifying the flush function. o Add macro to call the flush-state function. o Added +EF-FLUSH+ o Use vm::defenum to name the constants instead of the hand-written values. o Export +REPLACEMENT-CHARACTER-CODE+ o Document the slots in an efx stucture. code/fd-stream.lisp: o Add ef-flush def-ef-macro to flush the state of an external format when closing an output file. If ef-flush-state is NIL, we just call EF-COUT to send out the replacement character. Otherwise, the flush-state function is called to handle it. o When closing an output character stream, call ef-flush to flush any state before flushing the buffers of the stream. o Document the unicode slots in an fd-stream. code/exports.lisp: o Export +REPLACEMENT-CHARACTER-CODE+
Illegal surrogate sequences (leading surrogate without trailing surrogate or a lone trailing surrogate) get replaced with the replacement character.
o Put some comments back in. o Put back some unicode/unicode-bootstrap conditionals.
Compile and load the external format code.
Fixes from Paul Foley: o Standard streams no longer change formats when *default-external-format* changes. Use stream:set-system-external-format instead, or (setf external-format). o char-to-octets properly handles surrogate characters being written. o Makes simple-streams work again. This change needs to be cross-compiled. 2009-07 binaries work for cross-compiling using the 19e/boot-2008-05-cross-unicode-*.lisp cross-compile script.
code/extfmts.lisp: o Move the +ss-ef-foo+ constants to here from strategy.lisp, and update them so they don't overlap with existing +ef-foo+ constants. o Update +ef-max+ accordingly. pcl/simple-streams/impl.lisp: o Use +ss-ef-str+ instead of +ef-str+ in simple-stream-strlen. pcl/simple-streams/strategy.lisp: o Comment out +ss-ef-foo+ constants. o Use +ef-max+ instead of +ss-ef-max+, which is no longer defined. o Fix bugs in %dc-write-chars-fn: - Use ef variable - Need to call flush-out-buffer, not flush-buffer for dual-channel streams.
Update STRING-ENCODE and STRING-DECODE to handle surrogate pairs correctly. Previously, each surrogate was converted individually. This is wrong; they should be treated as a single codepoint that is converted.
Fix FILE-STRING-LENGTH to handle unicode streams. From Paul.
Fix bug introduced when converting tables from hashes to ntries. From Paul Foley. This makes mac-roman and derived external formats work once again.
Merge Unicode work to trunk. From label unicode-utf16-extfmt-2009-06-11.
Update from Paul: I've moved some slots out of external-format so they can be shared between external-formats that are identical in all but some variables. Also fixed a bug in octets-to-string that made it return one character short, and used char-code-limit instead of #x100 to determine when octets-to-char returns a "?", so now it'll work without change on 8 or 16 bit lisps.
Create a new search-list "ext-formats" that is initialized to "library:ext-formats/". This makes it easier to add new directories where external formats can be found. The previous use made it difficult because the formats had to be in the subdirectory ext-formats. save.lisp: o Create and initialize new search-list. extfmts.lisp: o Use the new search-list instead of "library:ext-formats/".
New external format stuff from Paul. bootfiles/19e/boot-2008-06-1.lisp: o Use this bootfile to compile the change in external-format structure. Just needed to get rid of a restart when compiling pcl. code/exports.lisp: o Renames ENCODE-STRING to STRING-ENCODE. Similarly for DECODE-STRING. code/extfmts.lisp: pcl/simple-streams/impl.lisp: pcl/simple-streams/strategy.lisp: pcl/simple-streams/external-formats/iso8859-1.lisp: pcl/simple-streams/external-formats/utf-8.lisp: pcl/simple-streams/external-formats/void.lisp: o Updated for new external format. I think the main change is not having to do a funcall for each character. pcl/simple-streams/external-formats/aliases o New file describing different names for external formats. pcl/simple-streams/external-formats/crlf.lisp: o New file for composing external format for CR/LF pcl/simple-streams/external-formats/utf-16-be.lisp: pcl/simple-streams/external-formats/utf-16-le.lisp: o New files supporting UTF-16 BE and LE formats. tools/make-main-dist.sh: o Copy over the new files and the aliases file too.
Update from Paul Foley. o Disable package errors when loading up external formats. o A minor patch allowing string-to-octets and vice versa to write into a preallocated array (though they might still allocate a bigger one if necessary), o Fix up any confusion between simple-base-string and simple-string so that nothing breaks when/if they're not the same.
Import Paul Foley's external-formats support. New files: o code/extfmts.lisp o pcl/simple-streams/external-formats/iso8859-1.lisp o pcl/simple-streams/external-formats/void.lisp code/exports.lisp: o Export the new symbols STRING-TO-OCTETS, OCTETS-TO-STRING, *DEFAULT-EXTERNAL-FORMAT*, ENCODE-STRING, and DECODE-STRING from the STREAM package o Make the symbols in the EXT package too. pcl/simple-streams/internal.lisp: o Move the implementation of STRING-TO-OCTETS and friends to a new file (extfmts.lisp). pcl/simple-streams/external-formats/utf-8.lisp: o New implementation. tools/make-main-dist.sh: o Create new target directory to hold external formats o Copy all of the external formats to the new directory. tools/pclcom.lisp: o Compile new code tools/worldcom.lisp: o Compile code/extfmts.lisp tools/worldload.lisp: o Load code/extfmts.lisp
This form allows you to request diffs between any two revisions of this file. For each of the two "sides" of the diff, select a symbolic revision name using the selection box, or choose 'Use Text Field' and enter a numeric revision.
|Powered by ViewVC 1.1.5|