UYIP
The Mailing List for
Understanding Yiddish Information Processing

UYIP@THEWORLD.COM

To subscribe to UYIP send E-Mail to uyip-request@theworld.com with a blank subject and containing just the word subscribe. After you subscribe, you'll be sent the UYIP info file via E-Mail with additional details.


Guides


General Yiddish Information Processing Resources


Yiddish Documents

Here are some documents in Yiddish and English to accompany the UYIP mailing list.

Kinds of documents currently supplied are:



  1. MacOS Hebrew Plain Text - These documents are encoded using the MacOS Hebrew character set. In order to view them, you need to use Netscape on the Macintosh, set document encoding to User-defined, go to General Preferences, choose the fonts preferences dialog, choose user-defined under "for the encoding", then choose one of the fonts included in the Apple Hebrew Language Kit, e.g., corsiva, new peninim, etc. Refer to ftp://ftp.unicode.org/Public/MAPPINGS/VENDORS/APPLE/HEBREW.TXT for the set of characters in MacOS Hebrew, along with a mapping for each character in Unicode designed to achieve 100% roundtrip compatibility. (Note: this method is generally not suitable for creating usable Unicode documents, i.e., one-way mapping.)

    Note: make sure to choose a font encoded according to the Macintosh Hebrew character set in its entirety -- many/most Hebrew fonts for the Mac don't comply: in particular, they do not properly encode the double-yod with patah ("pasekh-tsvey-yudn") or rafe ("dekhele") characters. All of the fonts shipped by Apple with the Hebrew Language Kit are fine.



  2. Windows Hebrew Plain Text - These documents are encoded using the Windows Hebrew character set (also known as Code Page 1255, or CP1255). There are currently few browsers that properly display Yiddish. Stay tuned. Refer to ftp://ftp.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/CP1255.TXT for the set of characters in Windows Hebrew, along with a mapping for each character in Unicode designed to achieve 100% roundtrip compatibility. (Note: Windows 1255 maps quite well to Unicode, especially for Yiddish. There is practically a one-to-one correspondence between their Hebrew characters, including those used for Yiddish.)



  3. GIF Files - standard Compuserve and Web image file format. Viewable on Netscape and just about any other kind of browser. Binary data, not text of any kind. (However, in most cases, these just illustrate the way text is supposed to look; i.e., these provide a "picture" of text.)

  4. Unicode HTML files - Web text pages encoded in Unicode, primarily in the Unicode Hebrew block (Range 0590..05FF). The text is in HTML, that is, some version along the lines of the latest emerging Internet standards, namely the proposal for Internationalization of HTML, which is incorporated into the W3C's HTML 4.0 specification. Yiddish Unicode HTML files should be viewable with any web browser that supports all of the Hebrew characters of Unicode version 1.1 or later. Get one of the available Yiddish-capable web browsers to view a page in Unicode HTML files.

  5. Miscellaneous HTML files - presumably containing no special or interesting encodings, and viewable with any old graphic Web browser -- may have links to other, more interesting documents.

Shop for Yiddish-capable software, etc., at The Yiddish Voice Store Our sponsor: The Yiddish Voice Store sells Yiddish-capable software, plus Yiddish and Klezmer CDs, Yiddish books and videos

Unless otherwise noted, this document and all documents at this site Copyright (c) 2003 - 2009 to: Mark H. David (E-Mail: mhd@yv.org), All Rights Reserved.