From 7bd0a9eb2637a134ae5abdac77dcb9fc1622cfe4 Mon Sep 17 00:00:00 2001 From: John Terpstra Date: Fri, 15 Apr 2005 16:49:53 +0000 Subject: Adding convmv info suggested by Volker Lendecke. (This used to be commit 6c893e17b9a4f3b328377328d25a27a96db9f15d) --- docs/Samba-Guide/SBE-UpgradingSamba.xml | 38 +++++++++++++++++++++++++++++++++ 1 file changed, 38 insertions(+) (limited to 'docs/Samba-Guide') diff --git a/docs/Samba-Guide/SBE-UpgradingSamba.xml b/docs/Samba-Guide/SBE-UpgradingSamba.xml index 0100a34aa5..5f9fca2460 100644 --- a/docs/Samba-Guide/SBE-UpgradingSamba.xml +++ b/docs/Samba-Guide/SBE-UpgradingSamba.xml @@ -462,6 +462,7 @@ Paths: + It is important that both the &smb.conf; file and the secrets.tdb should be backed up before attempting any upgrade. The secrets.tdb file is version encoded and therefore a newer version may not work with an older version of Samba. A backup @@ -470,6 +471,43 @@ Paths: + + International Language Support + + + unicode + character set + codepage + internationalization + Samba-2.x had not support for Unicode, instead all national language character set support in file names + was done using particular locale codepage mapping techniques. Samba-3 supports Unicode in file names, thus + providing true internationalization support. + + + + 8-bit + Non-English users whose national language character set has special characters and who upgrade naively will + find that many files that have the special chracters in the file name will see them garbled and jumbled up. + This typically happens with umlauts and accents because these characters were particular to the codepage + that was in use with Samba-2.x using an 8-bit encoding scheme. + + + + UTF-8 + Files that are created with Samba-3 will use UTF-8 encoding. Should the file system ever end up with a + mix of codepage (unix charset) encoded file names and UTF-8 encoded file names, the mess will take some + effort to set straight. + + + + convmv + A very helpful tool is available from Bjorn Jacke's convmv + work. Convmv is a tool that can be used to convert file and directory names from one encoding method to + another. The most common use for this tool is to convert locale encoded files to UTF-8 Unicode encoding. + + + + -- cgit