Skip to main content

Questions tagged [utf-8]

For discussions about UCS Transformation Format with 8 bits (UTF8). It is an encoding table that represents all characters in the Unicode character set.

Filter by
Sorted by
Tagged with
131 votes
2 answers
29k views

Is it possible to create a QR code using text?

QR codes I have seen are mostly image files. But can you create QR codes using plain text? For example ASCII and UTF-8 have black boxes as characters. Can I use those together with spaces to create a ...
MCK's user avatar
  • 1,175
68 votes
2 answers
258k views

Setting UTF8 as default Character Encoding in Windows 7

is there a way to set Windows 7 to globally use UTF-8 as standard? its really annoying to set every single text editor to use it.
Baarn's user avatar
  • 6,664
54 votes
5 answers
197k views

Notepad++ inserting special Unicode characters in UTF-8

What's the best ways to enter special Unicode characters into a Notepad++ document? Do I have to rely on the operating system (Windows)? Looking for a see-and-click solution. I can bring up the ASCII ...
Bob Stein's user avatar
  • 1,421
50 votes
5 answers
42k views

How can I change the default encoding type Thunderbird uses when composing a new email?

Whenever I compose a new email message in Thunderbird, it defaults to the Western (ISO-8859-1) encoding, and I have to manually change it to UTF-8 every time. How do I change the default?
hpy's user avatar
  • 6,121
49 votes
1 answer
137k views

How can I convert multiple files to UTF-8 encoding using *nix command line tools? [duplicate]

Possible Duplicate: Batch-convert files for encoding or line ending I have a bunch of text files that I'd like to convert from any given charset to UTF-8 encoding. Are there any command line ...
jason's user avatar
  • 655
44 votes
8 answers
324k views

Excel: Change default encoding (file origin) of Text Import Wizard to UTF-8 (65001 : Unicode)

I am using a variety of tools to regularly prepare data for the web. One stage requires me to open a CSV in Excel, make changes and save the file. Is there a way to force Excel to accept UTF-8 ...
Dizzley's user avatar
  • 1,041
35 votes
6 answers
174k views

Converting text file to UTF-8 on Windows command prompt

I need to convert a text file to UTF-8 format via Windows command prompt. This needs to be done on another machine and I do not have rights to install software on that machine. I need something like: ...
user1107888's user avatar
35 votes
1 answer
150k views

ANSI to UTF-8 in Notepad++

I have a text encoded in ANSI: When I tried to convert it into UTF-8 (using the Notepad++ menu Encoding > UTF-8), I get some weird characters: I thought that UTF-8 was a superset of ANSI and that I ...
user3658425's user avatar
31 votes
5 answers
73k views

Is there a Linux command to find out if a file is UTF-8?

The Joomla .ini files require to be saved as UTF-8. After editing I'm not sure if the files are UTF-8 or not. Is there a Linux command like file or a few commands that would tell if a file is ...
Edward's user avatar
  • 509
27 votes
8 answers
93k views

Extract large zip file (50 GB) on Mac OS X

I was trying to move the files to another hard drive. So I archived all my photos in one large ZIP file using the Mac OS X built-in compress function. But the file failed to extract. I've tried ...
user avatar
24 votes
2 answers
122k views

Changing the default ANSI to UTF-8 in Notepad

I got an e-book reader a few months ago, and I finally figured out why it doesn't display .txt files correctly. It doesn't like ANSI. It has no problem with the other 3 types of coding notepad ...
Emma's user avatar
  • 241
22 votes
1 answer
22k views

Differences between en_US.utf8 and en_US.UTF-8?

I've had a terrible time getting zsh to play nicely with Debian Jessie, and I've come to the conclusion that my issues all stem from my system's locale. Running locale, I see LANG=en_US.utf8 LANGUAGE=...
rayhem's user avatar
  • 445
21 votes
3 answers
18k views

Program to check/look up UTF-8/Unicode characters in string on command line?

I've just realized I have a file on my system; it lists normally: $ ls -la TΕSТER.txt -rw-r--r-- 1 user user 8 2013-04-11 18:07 TΕSТER.txt $ cat TΕSТER.txt testing ... yet, it crashes a piece of ...
sdaau's user avatar
  • 5,718
21 votes
2 answers
63k views

How can I enforce so Notepad++ uses UTF-8 every time I create a new file?

I want to use the character encoding UTF-8 (without BOM) on my files. I can change the encoding from the "Format" menu, but it is reset every time I create a new file or open a file. How can I do so ...
Jonas's user avatar
  • 27.8k
20 votes
2 answers
24k views

What's the code page of UTF-8?

My command prompt's default code page is 936. I need to change it to UTF-8. chcp 65001 The above doesn't work. What's the correct one?
user avatar
19 votes
6 answers
35k views

Unicode, Unicode Big Endian or UTF-8? What is the difference? Which format is better?

When I try to save a text file with non-English text in Notepad, I get an option to choose between Unicode, Unicode Big Endian and UTF-8. What is the difference between these formats? Assuming I do ...
Ashwin Nanjappa's user avatar
19 votes
1 answer
2k views

Hyphen is not rendering in Notepad++

I'm sure this is an encoding issue, but I can't figure it out. I exported a spreadsheet from Excel as a UTF-8 CSV. This produced a CSV in the UTF-8-BOM character encoding. Opening this file in Notepad+...
Joaquim d'Souza's user avatar
15 votes
2 answers
74k views

Which languages does the character encoding UTF-8 support?

What written and speak-able languages does UTF-8 support? How many languages does the UTF-8 support?
William's user avatar
  • 1,398
14 votes
3 answers
63k views

Windows 7 UTF-8 and Unicode

Could someone please explain what has changed in Windows 7 (Pro 64-bit)? Details: Previously I had Windows XP and had some translations files (UTF-8 encoded) in CSV format. I was able to view the ...
Sha Le's user avatar
  • 141
14 votes
2 answers
43k views

What encoding to get Å Ä Ö to work

I'm writing a small application in VB.NET that will enable me to easily create a user "Windows 7 account" with a password, instead of going through the control panel. The problem I'm having ...
kagstrom2100's user avatar
13 votes
4 answers
20k views

How can I get zsh to display international characters properly?

I just started using zsh, and love it. However, I've stumbled upon an annoyance when it comes to international characters: ➜ ~ touch åäö.txt ➜ ~ ls ...
Morgan's user avatar
  • 821
13 votes
4 answers
12k views

Fix encoding of German umlauts in directories and filenames (ü = u╠ê and so on)

I have many zip-files where there are encoding errors for the German umlauts (äüöÄÜÖß). They show up in both the filename.zip as well as in the included directories and files like this: Fünf = Fu╠ênf ...
cider's user avatar
  • 131
12 votes
4 answers
33k views

How do I get excel to not mess around with the formatting of a .csv file

I have two issues with a csv file that occur when I try and edit it in Excel. I have an ISBN column that gets reformatted to scientific format every time I save the file The encoding for the file is ...
Martin Duys's user avatar
  • 1,191
11 votes
7 answers
12k views

Why does Unicode have big or little endian but UTF-8 doesn't?

UNICODE uses 2 bytes for one character, so it has big or little endian difference. For example, the character 哈 is 54 C8 in hex. And its UTF-8 therefore is: 11100101 10010011 10001000 UTF-8 uses 3 ...
Tiina's user avatar
  • 3,207
10 votes
2 answers
17k views

How do I convert UTF-8 special characters in Bash?

I am writing on a script that extracts and saves JPEG-attachements from emails and passes them to imagemagick. However, I am living in Germany and special characters in email text/subject as "ö", "ä", ...
Markus's user avatar
  • 269
10 votes
2 answers
28k views

How to find out the character set of a text file

Is the a tool (mac or online) that can help me to find out the character set of a text file, and then convert it to utf-8 ?
ragnarius's user avatar
  • 381
10 votes
5 answers
40k views

UTF-8 Angle brackets up and down?

I'm looking for angle brackets which point up and down as UTF-8 characters. Just for clarification: I need two characters, an angle bracket pointing up and one pointing down.
Hedge's user avatar
  • 1,164
10 votes
3 answers
12k views

Can I set up KiTTY to send F7 press instead of bringing up Print dialogue when F7 button is pressed?

Can I set up KiTTY to send F7 press instead of bringing up Print dialogue when I press F7? I make intense use of Midnight Commander and its editor, which both rely on F7 (to create a directory and to ...
Ivan's user avatar
  • 7,633
10 votes
4 answers
27k views

On Windows 7, dir or tree can't show unicode characters, even starting cmd with cmd /U

On Windows 7, dir or tree can't show unicode characters, even starting cmd with cmd /U So I would press Window Key + R to run something, and type in cmd /U so that the content might handle Unicode. ...
nonopolarity's user avatar
  • 9,696
10 votes
2 answers
6k views

Trying to understand UTF-8 locale portability (and SSH) on various Linux systems including OS X

I spend a lot of my time sshed into various machines, all of which are different (some are embedded, some run Linux, some run BSD, &c.). On my own local machines, However, I use OS X, which of ...
kine's user avatar
  • 1,829
9 votes
2 answers
3k views

Why does VIM show the Unicode code point and not the UTF-8 code value?

Consider this supposed line of code that I found in a PHP blog, note the quotes: throw new Exception(“That's not a server name!”); Those quotes are RIGHT DOUBLE QUOTATION MARK (Unicode code point: U+...
dotancohen's user avatar
  • 11.5k
9 votes
4 answers
14k views

How to find all files in directory that contain UTF-8 BOM (byte-order mark)?

On Windows, I need to find all files in a directory that contain UTF-8 BOM (byte-order mark). Which tool can do that and how? It can be a PowerShell script, some text editor's advanced search feature ...
Borek Bernard's user avatar
9 votes
2 answers
26k views

Log in to FTP using Windows Explorer with @ in the username

I need to set up a connection to an FTP server, but there are a couple of caveats. I CANNOT use third-party software (period). It has to be done in Windows Explorer. The other caveat is that there ...
Karl Is Wright's user avatar
9 votes
2 answers
2k views

In utf-8 collation, why 11- is less then 1-?

I found that the sort result in ASCII: Source file test: 1- 11- 1-a 11-a Sort using ASCII: $ LANG=en_US.ascii sort test 1- 1-a 11- 11-a And using UTF-8: $ LANG=en_US.utf8 sort test 1- 11- 11-...
Lenik's user avatar
  • 18.4k
9 votes
1 answer
76k views

How to change LibreOffice default text encoding?

I want to change the default text encoding used by LibreOffice when saving a document as a Text document. Where can I find this setting? I want it to be UTF-8 WITHOUT the BOM, which I believe is ...
Bakuriu's user avatar
  • 197
9 votes
2 answers
3k views

How to go to the n'th character, not byte, of a file?

In vim one can get to the 5th byte of the file with the following command: :goto 5 However, in a UTF-8 text this could be the 5th, 4th, or even 2nd character in the file. How to go to the 5th ...
dotancohen's user avatar
  • 11.5k
8 votes
4 answers
21k views

File encoding not preserved after saving in Notepad++

I'm using Notepad++ for editing my PHP scripts. However, I found a strange problem: when changing the encoding from ANSI to UTF-8 (without BOM), saving, closing, re-loading – then checking encoding: ...
Fuxi's user avatar
  • 793
8 votes
3 answers
72k views

Batch change encoding ascii files from utf-8 to iso-8859-1 [duplicate]

Possible Duplicate: Batch-convert files for encoding or line ending under Windows I need a tool like this http://www.rotatingscrew.com/utfcast.aspx But the tool should do the opposite, convert ...
Enrique's user avatar
  • 305
8 votes
1 answer
14k views

How can I insert a character code into a Linux text editor?

I need a lightweight GUI text editor on my Ubuntu Lucid system which lets me specify a Unicode code point (e.g. U+1234), and inserts that character to a UTF-8 text document. I know that gedit can't do ...
pts's user avatar
  • 7,380
8 votes
4 answers
6k views

Problem with SVN filename encoding on Mac OS X

I have some filename with some Unicode character in it. All filenames on Mac OS X are UTF8 encoded. Also $LANG is set to en_US.UTF-8. However, it seems svn has some problems with that: az@ip212 1054 ...
Albert's user avatar
  • 6,681
8 votes
3 answers
2k views

How to set Notepad2 to use UTF-8 instead of ANSI

Each time I create a new file in Notepad2, the encoding is ANSI (1252). I'd like instead to have UTF-8 as default. Is there a way to get this ?
user avatar
8 votes
5 answers
60k views

Change the default of encoding in Notepad

I wanted to change the default of encoding in Notepad (ANSI) to UTF-8. According to following link went ahead and this work was done. But still remains a problem. If create a text file, the default ...
WeLoveLinux's user avatar
7 votes
3 answers
9k views

gVim utf-8 in windows

When I open gVim it's localized to Sweidish, which is fine. But warning/error messages that have Swedish characters in them don't display correctly. When trying to delete a buffer with unsaved changes ...
Tobbe's user avatar
  • 369
7 votes
2 answers
9k views

Why does Notepad++ display some Unicode characters as square?

It can display ₐ₊₋ but some other subscripts (like ₂ₙ) are converted to squares. When I copy that square and paste on here the problem disappears: ₂. Why is that? The file is encoded in UTF-8. FYI: ...
Ooker's user avatar
  • 2,131
7 votes
1 answer
2k views

What causes 3 special characters after conversion to UTF-8?

One of our work steps involves saving an MS Excel worksheet as CSV and then using UltraEdit to convert the CSV to UTF-8 before importing it into a server system. The problem is that, after the ...
Torben Gundtofte-Bruun's user avatar
7 votes
3 answers
8k views

Which (default) fonts contain the Windows/Apple logo?

I'm looking for fonts which are installed on Windows respectively Apple by default which contain either the Windows or Apple-logo . According to this article it should be character U+F8FF ...
Hedge's user avatar
  • 1,164
7 votes
4 answers
11k views

Convert UTF-16 LE to UTF-8 in windows via command line

(question re-written to be more useful) I have a batch script which will interact with command line programs, take their output, and then perform decisions based on that output. One of the programs I ...
bfh47's user avatar
  • 103
7 votes
1 answer
9k views

Getting "Warning: Missing charsets in String to FontSet conversion"

I'm using X applications over an SSH connection to some machine. When I run an X app, say, xclock for simplicity, I get a console message saying: Warning: Missing charsets in String to FontSet ...
einpoklum's user avatar
  • 9,751
7 votes
1 answer
17k views

Force UTF-8 on every document opened in Notepad++

I want every document that I create or open in Notepad++ to always (no matter what) be encoded in UTF-8. It seems, that either I'm missing something or this is not possible. In Settings > ...
trejder's user avatar
  • 11.4k
7 votes
1 answer
306 views

Is there a way to force all file transactions with a filesystem to be UTF8 or UTF16 compliant?

What I want is to specify that for a directory, every file creation/modification within said directory will be checked by the kernel and if the filename has unsupported characters the offending ...
life of pi's user avatar

1
2 3 4 5 6