How to export from Word to Processing

Hello all,

I exported a Word Document to a text document.

When I load the text document in processing though, special signs such as äöü are destroyed.

Since my question:

How to export from word to processing?

I use ActiveDocument.SaveAs2 FileName:=newName, FileFormat:=wdFormatUnicodeText, ....

Word offers a lot of options as shown below.

Which one is correct?

Thanks a lot!

Warm regards,

Chrisir

wdFormatDocument 	0 	Microsoft Office Word 97 - 2003 binary file format.
wdFormatDOSText 	4 	Microsoft DOS text format.
wdFormatDOSTextLineBreaks 	5 	Microsoft DOS text with line breaks preserved.
wdFormatEncodedText 	7 	Encoded text format.
wdFormatFilteredHTML 	10 	Filtered HTML format.
wdFormatFlatXML 	19 	Open XML file format saved as a single XML file.
wdFormatFlatXMLMacroEnabled 	20 	Open XML file format with macros enabled saved as a single XML file.
wdFormatFlatXMLTemplate 	21 	Open XML template format saved as a XML single file.
wdFormatFlatXMLTemplateMacroEnabled 	22 	Open XML template format with macros enabled saved as a single XML file.
wdFormatOpenDocumentText 	23 	OpenDocument Text format.
wdFormatHTML 	8 	Standard HTML format.
wdFormatRTF 	6 	Rich text format (RTF).
wdFormatStrictOpenXMLDocument 	24 	Strict Open XML document format.
wdFormatTemplate 	1 	Word template format.
wdFormatText 	2 	Microsoft Windows text format.
wdFormatTextLineBreaks 	3 	Windows text format with line breaks preserved.
wdFormatUnicodeText 	7 	Unicode text format.
wdFormatWebArchive 	9 	Web archive format.
wdFormatXML 	11 	Extensible Markup Language (XML) format.
wdFormatDocument97 	0 	Microsoft Word 97 document format.
wdFormatDocumentDefault 	16 	Word default document file format. For Word, this is the DOCX format.
wdFormatPDF 	17 	PDF format.
wdFormatTemplate97 	1 	Word 97 template format.
wdFormatXMLDocument 	12 	XML document format.
wdFormatXMLDocumentMacroEnabled 	13 	XML document format with macros enabled.
wdFormatXMLTemplate 	14 	XML template format.
wdFormatXMLTemplateMacroEnabled 	15 	XML template format with macros enabled.
wdFormatXPS 	18 	XPS format.
1 Like

Make sure it’s UTF-8, which is Processing’s text encoding.

You can check if your text file is UTF-8 by opening it using a simpler text editor, like this 1:

2 Likes

Thanks, but utf 8 doesn’t appear in the list above.

Which format would be utf-8?

We can check current text encoding at the bottom part of Notepad3:

We can double-click it to change to some other encoding.

We can also double-click to swap LF to CR+LF or to change current syntax highlighting.

1 Like

I found a Powershell Skript (Win 10) to automatically convert all the txt files to utf8 in one go.

Thanks to all!