I need to import data from a Microsoft SQL Server View into STATA. I do that using odbc. That works also great. Unfortunately, I have two variables with an encoding Problem. Both Variables contain strings that have Umlaute in the german language (ä, ö, ü, ß). The weird thing is, that there are other variables containing Umlaute which are shown correctly. I used filefilter to change the encoding to utf-8, which corrected the two variables. But now all the others are not shown correctly anymore. Unicode analyze gave me the following output:
unicode analyze students_data1.txt
File summary (before starting):
1 file(s) specified
1 file(s) to be examined ...
File students_data1.txt (text file)
60496 lines in file
2069 lines ASCII
437 lines UTF-8
57990 lines need translation
I also tried unicode translate
, which ended with the same result as filefilter
. Is there a way, how I can only change the encoding of the two variables, without changing the entire dataset?