https://bugs.documentfoundation.org/show_bug.cgi?id=98627

--- Comment #27 from Tex2002ans <[email protected]> ---
Hmmm... the enhancement I could think of is something similar to bug 97179
Comment 6's solution.

Where you could:

- DO NOT suggest "technical" dictionary words on 1-letter typos.

I think that would help mitigate a lot of Shantanu's original issue.

- - -

This way, something like...

CASE 1: attachment 153263 's 1-letter "typo" like:

- य = U+092F = DEVANAGARI LETTER YA

Right-Click currently gets:

- 5 Marathi + 5 useless "technical words" at the bottom.

After:

- 5 "Marathi dictionary" suggestions

CASE 2: A 1-letter "typo" like:

- ȵ = U+0235 = LATIN SMALL LETTER N WITH CURL

Right-Click currently gets this in English:

- 15 1-letter suggestions + 5 "technical words"

After:

- 15 (or 20) 1-letter suggestions

- - -

> The group of these five words appear for every two-letter misspelled word in 
> any language:
>
> AI, HP, LG, UI, UX

If you go to:

- Tools > Options
- Languages and Locales > Writing Aids

there are the "User-Defined Dictionaries".

These exact words are a part of the dictionary:

- technical [All]

If you thought they were completely clogging up your Right-Click suggestions
for your specific language—Marathi—then all you have to do is just:

- UNCHECK the box for that dictionary.

- - -

And what is the "technical" dictionary?

It just has a big list of:

- (International) company names
  - Acer
  - AMD
  - Asus
  - ChatGPT
  - HP
  - IBM
  - Facebook
  - Microsoft
  - LibreOffice
- Technical extensions/terms/formats
  - HDMI
  - HEVC
  - HTTP
  - FLAC
  - PNG
  - GPT
  - LAN
  - iOS
- Greek/mathematical symbols
  - α
  - β
  - γ

This ensures that *any* language talking about "DOCX" or "PNG" files, or an
enormous company like "Microsoft" or "ChatGPT", won't get red squigglies.

And because many of these "technical words" are 2, 3, or 4 characters long,
they often appear in Right-Click on those 1, 2, or 3 character long "misspelled
words"!

- - -

> If none of the (visible) suggestions is helpful, then the human brain needs 
> to decide how to deal with the supposed spelling violation.
>
> I see no bug here...

Yep, same. "Annoyance", yes. Bug, no.

Uncheck the "technical [All]" dictionary in this specific case, and all those
complaints disappear.

But I think the enhancement I came up with at the top would be a big step in
the right direction. And it's non-intrusive. And I don't think anybody would
mind it. :)

Sure, those 2- and 3-character cases would still exist (and those are trickier,
see below).

But that suggestion I made would completely eliminate the busted 1-character
typo case! :)

- - -

> A spell checker is not a word prediction system.

Yep. If you want more technical details, I summarized quite a bit back in:

- /r/LibreOffice: "Is the auto-correction tool of many languages changing
correct words into others?"
-
https://old.reddit.com/r/libreoffice/comments/135kn9j/is_the_autocorrection_tool_of_many_languages/jkc7dz2/
   - Especially the "AutoCorrect Categories" and "Recommended Resources" and
"More Spellchecking Resources".

Especially see the info about "Edit Distance":

- https://en.wikipedia.org/wiki/Edit_distance

The Right-Click suggestions get "ranked", and mostly go by simple rules like:

- "Is there just a missing accent?"
   - -> Add an accent
- "Is there just a capital letter missing?"
   - -> Capitalize the word.
- "Did you accidentally squish 2 correct words together?"
   - -> Add a SPACE between.
- "Are 2 letters accidentally flipped?"
   - = Transposition
- "How many characters are different?"
   - 1 character difference goes much higher up the suggestion list.
   - 2 character difference goes much lower on the suggestion list.

So words that are much "closer" to the typo, get pushed up to very top of the
list, like:

- cafe vs. café
   - Same letters, but only missing 1 accent.
- microsoft vs. Microsoft
   - Same letters, but only 1 capitalization.
- libreoffice vs. LibreOffice
   - Same letters, but only 2 capitalization.

And ones that are much "further"/worse, appear towards the bottom of the list,
like:

- misspellling -> dispelling
   - Accidentally 3 'l's.
   - Plus an "ss" -> "s".
   - Plus an 'm' -> 'd'.
   - = 3 full letters away!!!

The user likely wanted "misspelling", which is the 1st/"very best" Right-Click
choice... but the 4th ranked "dispelling" could still be possible.

Like Comment #25 says, it's then up to the user to decide to take that advice
(or ignore it).

-- 
You are receiving this mail because:
You are the assignee for the bug.

Reply via email to