https://bugs.documentfoundation.org/show_bug.cgi?id=168639

            Bug ID: 168639
           Summary: text import dialog -- ENHANCEMENT add regExp separator
                    such as numeric and dates
           Product: LibreOffice
           Version: 25.8.1.1 release
          Hardware: All
                OS: Linux (All)
            Status: UNCONFIRMED
          Severity: normal
          Priority: medium
         Component: Calc
          Assignee: [email protected]
          Reporter: [email protected]

Created attachment 203076
  --> https://bugs.documentfoundation.org/attachment.cgi?id=203076&action=edit
Text Import dialog LO Calc future release

LO Calc 25.8.1
concern: text import dialog

Hi,
this a general feature enhancements, which would slash down other bugs reports.

Challenge: convert text to csv

It's common to copy tables from text source (html, pdf, docx, etc) and the text
import dialog lacks solutions.

1/ TEXT TO COPY
E.g.copy & paste (CTRL+SHIFT+V) to open text import dialog the following:
NVIDIA 4,249.99 5.44 Info Tech
MICROSOFT CORP 3,577.70 4.58 Info Tech
APPLE 3,467.20 4.44 Info Tech
AMAZON.COM 2,188.03 2.80 Cons Discr
META PLATFORMS A 1,603.83 2.05 Comm Srvcs
BROADCOM 1,328.83 1.70 Info Tech
ALPHABET A 1,239.14 1.59 Comm Srvcs
ALPHABET C 1,049.09 1.34 Comm Srvcs
TESLA 967.84 1.24 Cons Discr
JPMORGAN CHASE & CO 837.67 1.07 Financials


2/ With detect you'll get sthg as, which is not what you want, namely:
NVIDIA;4,249.99;5.44;Info;Tech;;
MICROSOFT;CORP;3,577.70;4.58;Info;Tech;
APPLE;3,467.20;4.44;Info;Tech;;
AMAZON.COM;2,188.03;2.80;Cons;Discr;;
META;PLATFORMS;A;1,603.83;2.05;Comm;Srvcs
BROADCOM;1,328.83;1.70;Info;Tech;;
ALPHABET;A;1,239.14;1.59;Comm;Srvcs;
ALPHABET;C;1,049.09;1.34;Comm;Srvcs;
TESLA;967.84;1.24;Cons;Discr;;
JPMORGAN;CHASE;&;CO;837.67;1.07;Financials


3/ Your aim(that you can not presently attain due to fact that numerical value
are not treated as separators):
NVIDIA;4249.99;5.44;Info Tech
MICROSOFT CORP;3577.70;4.58;Info Tech
APPLE;3467.20;4.44;Info Tech
AMAZON.COM;2188.03;2.80;Cons Discr
META PLATFORMS A;1603.83;2.05;Comm Srvcs
BROADCOM;1328.83;1.70;InfoTech
ALPHABET A;1239.14;1.59;Comm Srvcs
ALPHABET C;1049.09;1.34;Comm Srvcs
TESLA;967.84;1.24;Cons Discr
JPMORGAN CHASE & CO;837.67;1.07;Financials


4/ SOLUTION:
Well LO Calc has already what is needed, but needs to brought into the "text
import dialog".
Attached screenshots are:
1/ text import dialog as current 25.8.1,
2/ text import dialog a proposed dirty mock modification for future release.

In this mock dialog, I propose to had as separator new fields being regExp with
a pull down list with choices such numeric, dates, etc
This is similar to what already exists in Calc, when one change the format of
cells(CTRL+1) and choose tab "Numbers" in which you can choose for instance
dates as YYYY-MM-DD (iso 8601) or whatever.


Additionnaly,
I propose to have an indication of the consistancy of the output by having a
first field the number of cols LO Calc detects for each row.
So in the case of
NVIDIA;4,249.99;5.44;Info;Tech;;
MICROSOFT;CORP;3,577.70;4.58;Info;Tech;
APPLE;3,467.20;4.44;Info;Tech;;
AMAZON.COM;2,188.03;2.80;Cons;Discr;;
META;PLATFORMS;A;1,603.83;2.05;Comm;Srvcs
BROADCOM;1,328.83;1.70;Info;Tech;;
ALPHABET;A;1,239.14;1.59;Comm;Srvcs;
ALPHABET;C;1,049.09;1.34;Comm;Srvcs;
TESLA;967.84;1.24;Cons;Discr;;
JPMORGAN;CHASE;&;CO;837.67;1.07;Financials

The output would be:
4;NVIDIA;4249,99;5,44;Info;Tech;;
5;MICROSOFT;CORP;3577,7;4,58;Info;Tech;
4;APPLE;3467,2;4,44;Info;Tech;;
4;AMAZON.COM;2188,03;2,8;Cons;Discr;;
6;META;PLATFORMS;A;1603,83;2,05;Comm;Srvcs
4;BROADCOM;1328,83;1,7;Info;Tech;;
5;ALPHABET;A;1239,14;1,59;Comm;Srvcs;
5;ALPHABET;C;1049,09;1,34;Comm;Srvcs;
4;TESLA;967,84;1,24;Cons;Discr;;
7;JPMORGAN;CHASE;&;CO;837,67;1,07;Financials


Thank you

-- 
You are receiving this mail because:
You are the assignee for the bug.

Reply via email to