By default, Bugzilla does not search the list of RESOLVED bugs.
You can force it to do so by putting the upper-case word ALL in front of your search query, e.g.: ALL tdelibs
We recommend searching for bugs this way, as you may discover that your bug has already been resolved and fixed in a later release.
Bug 2828 - various unicode characters don't show in ubuntu
Summary: various unicode characters don't show in ubuntu
Status: NEW
Alias: None
Product: TDE
Classification: Unclassified
Component: tdebase (show other bugs)
Version: R14.0.x [Trinity]
Hardware: All Linux
: P5 normal
Assignee: Timothy Pearson
URL:
Depends on:
Blocks:
 
Reported: 2017-09-04 14:53 CDT by wofgdkncxojef
Modified: 2018-08-30 03:01 CDT (History)
4 users (show)

See Also:
Compiler Version:
TDE Version String:
Application Version:
Application Name:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description wofgdkncxojef 2017-09-04 14:53:37 CDT
various unicode charcaters don't show in trinity on ubuntu, example 
Comment 1 deloptes 2017-09-04 22:54:25 CDT
> in here "Miscellaneous Technical"
>
https://en.wikipedia.org/wiki/List_of_Unicode_characters#Miscellaneous_Technical
> firefox shows almost all of them
> konqueror shows almost none.
> 

Indeed I do not see the same as in firefox. [...]

IMO it is a combination of font and rendering. Can you try with kcharselect?

I can see the symbols with 

Font: Liberation Mono   (or most mono type fonts)
Table: 35
Unicode: 2300

For example the math symbols are visible with
Font: Latin Modern Sans Quotation       (or most sans type fonts)       
Table: 34
Unicode: 2200

Check if you have all needed fonts installed. When I was using LaTeX I
installed many additional fonts. I replaced the PC with newly installed
since, so might be they are missing here again.

Check also the default fonts setup in TDE.
Comment 2 wofgdkncxojef 2017-09-04 23:29:37 CDT
-_- ....
crap, the site ate all my post after the character "grapes" U+1F347 and didn't noticed....
i had "grapes" at the end of the title too....

I'm on linux mint 18.2 mate 64bit
i can see the grapes in xed, libreoffice, firefox, vlc, kiten and synaptic. 
But in none of trinity apps.....

$ echo "$LANG"
en_US.UTF-8

with kcharselect
what your saying seams correct.
configuration seams correct....
Comment 3 deloptes 2017-09-05 00:16:06 CDT
OK, so we can conclude that konqueror does not pick up the correct font to display the characters. It seems like a valid hypothesis.

The statement
"i can see the grapes in xed, libreoffice, firefox, vlc, kiten and synaptic. 
But in none of trinity apps....."

seems contradictory - do you see this in kcharselect?

please confirm
Comment 4 deloptes 2017-09-05 01:03:09 CDT
besides this kcharselect can only handle 4digit code and what you stated (U+1F347) is 5 digit. I don't know how it can be displayed in kcharselect.

Most likely there needs to be some update to handle this.
Comment 5 wofgdkncxojef 2017-09-05 01:26:46 CDT
yea kcharselect stops at FFFF ....

in control center
standard fonts: Sans serif
default encoding: "use language encoding"

i can see the grapes and other stuff in the apps i'm mentioning, but not in trinity apps (kiten has a qt4 version)

Not in:
konsole, kate, kwrite, konqueror, kmail, klipper......

i also tried trinity apps from inside mate. same result.
Comment 6 deloptes 2017-09-05 02:25:36 CDT
I think we narrowed down the problem. Someone needs to find time to look into the code and decide what needs to be updated.

thanks so far and regards
Comment 7 Timothy Pearson 2017-09-05 21:52:15 CDT
I've noticed something like this for a while but thought it was just my machine.  In my case I can't see symbols like TM (™) in TDE applications.

Can you confirm that you are unable to see the "™" symbol in TDE as well?

Thanks!
Comment 8 wofgdkncxojef 2017-09-05 22:45:41 CDT
I can actually see "™"
tried konsole, kmail, akregator, kate.....
Comment 9 deloptes 2017-09-06 00:54:22 CDT
(™) is U+2122 and is visible in kcharselect, so it should be visible in the tde apps (assuming you have the proper font)

Font: Sans Serif       (or most sans type fonts)       
Table: 33
Unicode: 2122

It is also visible in konqueror - try https://en.wikipedia.org/wiki/List_of_Unicode_characters#Letterlike_Symbols

However there is significant difference between konqueror and kcharselect. In konqueror you see a fraction of what you see in kcharselect
Comment 10 deloptes 2018-01-14 05:55:57 CST
man tqchar

So to fix this we should fix tqchar to handle size bigger than U+0000

       The QChar class provides a lightweight Unicode character.

       Unicode characters are (so far) 16-bit entities without any markup or structure. This class represents such an entity. It is lightweight, so it
       can be used everywhere. Most compilers treat it like a "short int". (In a few years it may be necessary to make QChar 32-bit when more than
       65536 Unicode code points have been defined and come into use.)

       QChar provides a full complement of testing/classification functions, converting to and from other formats, converting from composed to
       decomposed Unicode, and trying to compare and case-convert if you ask it to.

       The classification functions include functions like those in ctype.h, but operating on the full range of Unicode characters. They all return
       TRUE if the character is a certain type of character; otherwise they return FALSE. These classification functions are isNull() (returns TRUE if
       the character is U+0000), isPrint() (TRUE if the character is any sort of printable character, including whitespace), isPunct() (any sort of
       punctation), isMark() (Unicode Mark), isLetter (a letter), isNumber() (any sort of numeric character), isLetterOrNumber(), and isDigit()
       (decimal digits). All of these are wrappers around category() which return the Unicode-defined category of each character.