Questions tagged [utf-8]
For discussions about UCS Transformation Format with 8 bits (UTF8). It is an encoding table that represents all characters in the Unicode character set.
275
questions
131
votes
2
answers
29k
views
Is it possible to create a QR code using text?
QR codes I have seen are mostly image files. But can you create QR codes using plain text?
For example ASCII and UTF-8 have black boxes as characters. Can I use those together with spaces to create a ...
68
votes
2
answers
258k
views
Setting UTF8 as default Character Encoding in Windows 7
is there a way to set Windows 7 to globally use UTF-8 as standard?
its really annoying to set every single text editor to use it.
54
votes
5
answers
197k
views
Notepad++ inserting special Unicode characters in UTF-8
What's the best ways to enter special Unicode characters into a Notepad++ document? Do I have to rely on the operating system (Windows)?
Looking for a see-and-click solution.
I can bring up the ASCII ...
50
votes
5
answers
42k
views
How can I change the default encoding type Thunderbird uses when composing a new email?
Whenever I compose a new email message in Thunderbird, it defaults to the Western (ISO-8859-1) encoding, and I have to manually change it to UTF-8 every time.
How do I change the default?
49
votes
1
answer
137k
views
How can I convert multiple files to UTF-8 encoding using *nix command line tools? [duplicate]
Possible Duplicate:
Batch-convert files for encoding or line ending
I have a bunch of text files that I'd like to convert from any given charset to UTF-8 encoding.
Are there any command line ...
44
votes
8
answers
324k
views
Excel: Change default encoding (file origin) of Text Import Wizard to UTF-8 (65001 : Unicode)
I am using a variety of tools to regularly prepare data for the web. One stage requires me to open a CSV in Excel, make changes and save the file.
Is there a way to force Excel to accept UTF-8 ...
35
votes
6
answers
174k
views
Converting text file to UTF-8 on Windows command prompt
I need to convert a text file to UTF-8 format via Windows command prompt. This needs to be done on another machine and I do not have rights to install software on that machine. I need something like:
...
35
votes
1
answer
150k
views
ANSI to UTF-8 in Notepad++
I have a text encoded in ANSI:
When I tried to convert it into UTF-8 (using the Notepad++ menu Encoding > UTF-8), I get some weird characters:
I thought that UTF-8 was a superset of ANSI and that I ...
31
votes
5
answers
73k
views
Is there a Linux command to find out if a file is UTF-8?
The Joomla .ini files require to be saved as UTF-8.
After editing I'm not sure if the files are UTF-8 or not.
Is there a Linux command like file or a few commands that would tell if a file is ...
27
votes
8
answers
93k
views
Extract large zip file (50 GB) on Mac OS X
I was trying to move the files to another hard drive. So I archived all my photos in one large ZIP file using the Mac OS X built-in compress function.
But the file failed to extract. I've tried ...
24
votes
2
answers
122k
views
Changing the default ANSI to UTF-8 in Notepad
I got an e-book reader a few months ago, and I finally figured out why it doesn't display .txt files correctly. It doesn't like ANSI. It has no problem with the other 3 types of coding notepad ...
22
votes
1
answer
22k
views
Differences between en_US.utf8 and en_US.UTF-8?
I've had a terrible time getting zsh to play nicely with Debian Jessie, and I've come to the conclusion that my issues all stem from my system's locale. Running locale, I see
LANG=en_US.utf8
LANGUAGE=...
21
votes
3
answers
18k
views
Program to check/look up UTF-8/Unicode characters in string on command line?
I've just realized I have a file on my system; it lists normally:
$ ls -la TΕSТER.txt
-rw-r--r-- 1 user user 8 2013-04-11 18:07 TΕSТER.txt
$ cat TΕSТER.txt
testing
... yet, it crashes a piece of ...
21
votes
2
answers
63k
views
How can I enforce so Notepad++ uses UTF-8 every time I create a new file?
I want to use the character encoding UTF-8 (without BOM) on my files. I can change the encoding from the "Format" menu, but it is reset every time I create a new file or open a file.
How can I do so ...
20
votes
2
answers
24k
views
What's the code page of UTF-8?
My command prompt's default code page is 936.
I need to change it to UTF-8.
chcp 65001
The above doesn't work. What's the correct one?
19
votes
6
answers
35k
views
Unicode, Unicode Big Endian or UTF-8? What is the difference? Which format is better?
When I try to save a text file with non-English text in Notepad, I get an option to choose between Unicode, Unicode Big Endian and UTF-8. What is the difference between these formats?
Assuming I do ...
19
votes
1
answer
2k
views
Hyphen is not rendering in Notepad++
I'm sure this is an encoding issue, but I can't figure it out.
I exported a spreadsheet from Excel as a UTF-8 CSV. This produced a CSV in the UTF-8-BOM character encoding. Opening this file in Notepad+...
15
votes
2
answers
74k
views
Which languages does the character encoding UTF-8 support?
What written and speak-able languages does UTF-8 support?
How many languages does the UTF-8 support?
14
votes
3
answers
63k
views
Windows 7 UTF-8 and Unicode
Could someone please explain what has changed in Windows 7 (Pro 64-bit)?
Details:
Previously I had Windows XP and had some translations files (UTF-8 encoded) in CSV format. I was able to view the ...
14
votes
2
answers
43k
views
What encoding to get Å Ä Ö to work
I'm writing a small application in VB.NET that will enable me to
easily create a user "Windows 7 account" with a password, instead of
going through the control panel.
The problem I'm having ...
13
votes
4
answers
20k
views
How can I get zsh to display international characters properly?
I just started using zsh, and love it. However, I've stumbled upon an annoyance when it comes to international characters:
➜ ~ touch åäö.txt
➜ ~ ls
...
13
votes
4
answers
12k
views
Fix encoding of German umlauts in directories and filenames (ü = u╠ê and so on)
I have many zip-files where there are encoding errors for the German umlauts (äüöÄÜÖß). They show up in both the filename.zip as well as in the included directories and files like this:
Fünf = Fu╠ênf
...
12
votes
4
answers
33k
views
How do I get excel to not mess around with the formatting of a .csv file
I have two issues with a csv file that occur when I try and edit it in Excel.
I have an ISBN column that gets reformatted to scientific format every time I save the file
The encoding for the file is ...
11
votes
7
answers
12k
views
Why does Unicode have big or little endian but UTF-8 doesn't?
UNICODE uses 2 bytes for one character, so it has big or little endian difference. For example, the character 哈 is 54 C8 in hex.
And its UTF-8 therefore is:
11100101 10010011 10001000
UTF-8 uses 3 ...
10
votes
2
answers
17k
views
How do I convert UTF-8 special characters in Bash?
I am writing on a script that extracts and saves JPEG-attachements from emails and passes them to imagemagick. However, I am living in Germany and special characters in email text/subject as "ö", "ä", ...
10
votes
2
answers
28k
views
How to find out the character set of a text file
Is the a tool (mac or online) that can help me to find out the character set of a text file, and then convert it to utf-8 ?
10
votes
5
answers
40k
views
UTF-8 Angle brackets up and down?
I'm looking for angle brackets which point up and down as UTF-8 characters.
Just for clarification: I need two characters, an angle bracket pointing up and one pointing down.
10
votes
3
answers
12k
views
Can I set up KiTTY to send F7 press instead of bringing up Print dialogue when F7 button is pressed?
Can I set up KiTTY to send F7 press instead of bringing up Print dialogue when I press F7?
I make intense use of Midnight Commander and its editor, which both rely on F7 (to create a directory and to ...
10
votes
4
answers
27k
views
On Windows 7, dir or tree can't show unicode characters, even starting cmd with cmd /U
On Windows 7, dir or tree can't show unicode characters, even starting cmd with cmd /U
So I would press Window Key + R to run something, and type in cmd /U so that the content might handle Unicode.
...
10
votes
2
answers
6k
views
Trying to understand UTF-8 locale portability (and SSH) on various Linux systems including OS X
I spend a lot of my time sshed into various machines, all of which are different (some are embedded, some run Linux, some run BSD, &c.). On my own local machines, However, I use OS X, which of ...
9
votes
2
answers
3k
views
Why does VIM show the Unicode code point and not the UTF-8 code value?
Consider this supposed line of code that I found in a PHP blog, note the quotes:
throw new Exception(“That's not a server name!”);
Those quotes are RIGHT DOUBLE QUOTATION MARK (Unicode code point: U+...
9
votes
4
answers
14k
views
How to find all files in directory that contain UTF-8 BOM (byte-order mark)?
On Windows, I need to find all files in a directory that contain UTF-8 BOM (byte-order mark). Which tool can do that and how?
It can be a PowerShell script, some text editor's advanced search feature ...
9
votes
2
answers
26k
views
Log in to FTP using Windows Explorer with @ in the username
I need to set up a connection to an FTP server, but there are a couple of caveats. I CANNOT use third-party software (period). It has to be done in Windows Explorer.
The other caveat is that there ...
9
votes
2
answers
2k
views
In utf-8 collation, why 11- is less then 1-?
I found that the sort result in ASCII:
Source file test:
1-
11-
1-a
11-a
Sort using ASCII:
$ LANG=en_US.ascii sort test
1-
1-a
11-
11-a
And using UTF-8:
$ LANG=en_US.utf8 sort test
1-
11-
11-...
9
votes
1
answer
76k
views
How to change LibreOffice default text encoding?
I want to change the default text encoding used by LibreOffice when saving a document as a Text document. Where can I find this setting?
I want it to be UTF-8 WITHOUT the BOM, which I believe is ...
9
votes
2
answers
3k
views
How to go to the n'th character, not byte, of a file?
In vim one can get to the 5th byte of the file with the following command:
:goto 5
However, in a UTF-8 text this could be the 5th, 4th, or even 2nd character in the file. How to go to the 5th ...
8
votes
4
answers
21k
views
File encoding not preserved after saving in Notepad++
I'm using Notepad++ for editing my PHP scripts.
However, I found a strange problem: when changing the encoding from ANSI to UTF-8 (without BOM), saving, closing, re-loading – then checking encoding: ...
8
votes
3
answers
72k
views
Batch change encoding ascii files from utf-8 to iso-8859-1 [duplicate]
Possible Duplicate:
Batch-convert files for encoding or line ending under Windows
I need a tool like this
http://www.rotatingscrew.com/utfcast.aspx
But the tool should do the opposite, convert ...
8
votes
1
answer
14k
views
How can I insert a character code into a Linux text editor?
I need a lightweight GUI text editor on my Ubuntu Lucid system which lets me specify a Unicode code point (e.g. U+1234), and inserts that character to a UTF-8 text document. I know that gedit can't do ...
8
votes
4
answers
6k
views
Problem with SVN filename encoding on Mac OS X
I have some filename with some Unicode character in it. All filenames on Mac OS X are UTF8 encoded. Also $LANG is set to en_US.UTF-8.
However, it seems svn has some problems with that:
az@ip212 1054 ...
8
votes
3
answers
2k
views
How to set Notepad2 to use UTF-8 instead of ANSI
Each time I create a new file in Notepad2, the encoding is ANSI (1252).
I'd like instead to have UTF-8 as default. Is there a way to get this ?
8
votes
5
answers
60k
views
Change the default of encoding in Notepad
I wanted to change the default of encoding in Notepad (ANSI) to UTF-8. According to following link went ahead and this work was done.
But still remains a problem. If create a text file, the default ...
7
votes
3
answers
9k
views
gVim utf-8 in windows
When I open gVim it's localized to Sweidish, which is fine. But warning/error messages that have Swedish characters in them don't display correctly. When trying to delete a buffer with unsaved changes ...
7
votes
2
answers
9k
views
Why does Notepad++ display some Unicode characters as square?
It can display ₐ₊₋ but some other subscripts (like ₂ₙ) are converted to squares. When I copy that square and paste on here the problem disappears: ₂. Why is that? The file is encoded in UTF-8.
FYI: ...
7
votes
1
answer
2k
views
What causes 3 special characters after conversion to UTF-8?
One of our work steps involves saving an MS Excel worksheet as CSV and then using UltraEdit to convert the CSV to UTF-8 before importing it into a server system.
The problem is that, after the ...
7
votes
3
answers
8k
views
Which (default) fonts contain the Windows/Apple logo?
I'm looking for fonts which are installed on Windows respectively Apple by default which contain either the Windows or Apple-logo .
According to this article it should be character
U+F8FF ...
7
votes
4
answers
11k
views
Convert UTF-16 LE to UTF-8 in windows via command line
(question re-written to be more useful)
I have a batch script which will interact with command line programs, take their output, and then perform decisions based on that output.
One of the programs I ...
7
votes
1
answer
9k
views
Getting "Warning: Missing charsets in String to FontSet conversion"
I'm using X applications over an SSH connection to some machine. When I run an X app, say, xclock for simplicity, I get a console message saying:
Warning: Missing charsets in String to FontSet ...
7
votes
1
answer
17k
views
Force UTF-8 on every document opened in Notepad++
I want every document that I create or open in Notepad++ to always (no matter what) be encoded in UTF-8. It seems, that either I'm missing something or this is not possible.
In Settings > ...
7
votes
1
answer
306
views
Is there a way to force all file transactions with a filesystem to be UTF8 or UTF16 compliant?
What I want is to specify that for a directory, every file creation/modification within said directory will be checked by the kernel and if the filename has unsupported characters the offending ...