Bug 150621

Summary: Word count incorrect for Korean text
Product: LibreOffice Reporter: chansol98s
Component: WriterAssignee: Not Assigned <libreoffice-bugs>
Status: NEW ---    
Severity: normal    
Priority: medium    
Version: 7.3.4.2 release   
Hardware: All   
OS: All   
Whiteboard:
Crash report or crash signature: Regression By:
Bug Depends on:    
Bug Blocks: 103479, 113196    

Description chansol98s 2022-08-26 11:25:10 UTC
Description:
Libreoffice Writer counts every Korean character as a word, which results in incorrect word counts for Korean text.

Steps to Reproduce:
1. In Writer, input any text in Korean. For example, "이 문장은 여섯 단어로 이루어져 있습니다."
2. Check the word count at Tools > Word Count

Actual Results:
The word count is incorrect, counting every Korean character as a word. For the above example, the word count shows up as 18.

Expected Results:
The word count should only count words, not characters. For the above example, the word count should be 6.


Reproducible: Always


User Profile Reset: No



Additional Info:
In Korean writing, a word is defined the same way as English, so a string of characters separated by a space counts as a single word. 

Some more examples:
아주 짧은 예시  //Actual 3 words, Writer counts 6.
이 문장은 여섯 단어로 이루어져 있습니다.  //Actual 6 words, Writer counts 18.
예시로, 이런 문장도 가능하겠지요?  //Actual 4 words, Writer counts 16.


Version: 7.3.4.2
Build ID: 30(Build:2)
CPU threads: 12; OS: Linux 5.18; UI render: default; VCL: kf5 (cairo+xcb)
Locale: en-US (en_US.UTF-8); UI: en-US
Calc: threaded
Comment 1 Michael Warner 2022-08-26 14:25:29 UTC
Repro in:
Version: 7.4.0.2 / LibreOffice Community
Build ID: 1512ce97d7ed39dce3121f7e15651fd8895f950e
CPU threads: 10; OS: Mac OS X 12.5; UI render: default; VCL: osx
Locale: en-US (en_US.UTF-8); UI: en-US
Calc: threaded
Comment 2 J_A 2024-05-07 03:05:35 UTC
This bug is still present.

Tested using:
Version: 24.2.2.2 (X86_64) / LibreOffice Community
Build ID: d56cc158d8a96260b836f100ef4b4ef25d6f1a01
CPU threads: 16; OS: Windows 10.0 Build 19045; UI render: Skia/Vulkan; VCL: win
Locale: en-US (en_US); UI: en-US
Calc: CL threaded

and

Version: 24.8.0.0.alpha0+ (X86_64) / LibreOffice Community
Build ID: bc7501d30ab9a3a5edec66d2f0022177f85d6127
CPU threads: 16; OS: Windows 10 (10.0 build 19045); UI render: Skia/Vulkan; VCL: win
Locale: en-US (en_US); UI: en-US
Calc: CL threaded