Re: FOSS tool to do general stats from text indata

2023-06-30 Thread Emanuel Berg
>> Well if you were prepared to type a search for >> computational linguistics software into google, you would >> find several free tools available for linux listed on pages >> such as >> >> https://martinweisser.org/corpora_site/comp_ling_resources.html > > Indeed, that page has 4 hits for Unix an

Re: FOSS tool to do general stats from text indata

2023-06-30 Thread Emanuel Berg
debian-user wrote: > Well if you were prepared to type a search for computational > linguistics software into google, you would find several > free tools available for linux listed on pages such as > > https://martinweisser.org/corpora_site/comp_ling_resources.html Indeed, that page has 4 hits fo

Re: FOSS tool to do general stats from text indata

2023-06-30 Thread debian-user
Emanuel Berg wrote: > Nicholas Geovanis wrote: > > > Those books teach and discuss some of the software that's > > used. I doubt you will find them in debian's repositories. > > Of course you can do plenty of computational linguistics > > with perl or python which you already have. > > > > What i

Re: FOSS tool to do general stats from text indata

2023-06-30 Thread Emanuel Berg
>> A basic search finds this web tool: >> >> https://www.usingenglish.com/resources/text-statistics/ > > I didn't get it to work in Emacs-w3m, be it lack of JavaScript > support or something else. Anyway the page and tool claims to > do this: > > Total Word Count > Total Word Count (Excluding

Re: FOSS tool to do general stats from text indata

2023-06-30 Thread Emanuel Berg
Nicholas Geovanis wrote: > Those books teach and discuss some of the software that's > used. I doubt you will find them in debian's repositories. > Of course you can do plenty of computational linguistics > with perl or python which you already have. > > What is a "regular expression" which is at

Re: FOSS tool to do general stats from text indata

2023-06-30 Thread Nicholas Geovanis
On Fri, Jun 30, 2023, 10:32 AM Emanuel Berg wrote: > Nicholas Geovanis wrote: > > > If you have python programming skills, you might > > consider NLTK > > Unbelievable if there are no such tools anywhere already, > but I don't have one either so maybe there aren't then? > >

Re: FOSS tool to do general stats from text indata

2023-06-30 Thread Emanuel Berg
Joel Roth wrote: > A basic search finds this web tool: > > https://www.usingenglish.com/resources/text-statistics/ I didn't get it to work in Emacs-w3m, be it lack of JavaScript support or something else. Anyway the page and tool claims to do this: Total Word Count Total Word Count (Excludi

Re: FOSS tool to do general stats from text indata

2023-06-30 Thread Emanuel Berg
Nicholas Geovanis wrote: > If you have python programming skills, you might > consider NLTK Unbelievable if there are no such tools anywhere already, but I don't have one either so maybe there aren't then? >>> >>> There's a big subject called computational linguistics. >>> T

Re: FOSS tool to do general stats from text indata

2023-06-30 Thread Nicholas Geovanis
On Fri, Jun 30, 2023, 8:32 AM Emanuel Berg wrote: > Nicholas Geovanis wrote: > > >>> If you have python programming skills, you might consider > >>> NLTK > >> > >> Unbelievable if there are no such tools anywhere already, > >> but I don't have one either so maybe there aren't then? > >> > > > > T

Re: FOSS tool to do general stats from text indata

2023-06-30 Thread Emanuel Berg
Nicholas Geovanis wrote: >>> If you have python programming skills, you might consider >>> NLTK >> >> Unbelievable if there are no such tools anywhere already, >> but I don't have one either so maybe there aren't then? >> > > There's a big subject called computational linguistics. > They have some

Re: FOSS tool to do general stats from text indata

2023-06-28 Thread Nicholas Geovanis
On Sat, Jun 24, 2023, 3:04 PM Emanuel Berg wrote: > Cousin Stanley wrote: > > > If you have python programming skills, you might consider > > NLTK > > Unbelievable if there are no such tools anywhere already, but > I don't have one either so maybe there aren't then? > There's a big subject calle

Re: FOSS tool to do general stats from text indata

2023-06-28 Thread Emanuel Berg
dvalin wrote: > As "stats" is a grab bag larger inside than the Tardis, > I suspect that only on that other ship with the infinite > improbability drive is a stats babelfish interpreter to be > found. For the last 30+ years, I've just thrown together > a few lines of Awk to generate the initially

Re: FOSS tool to do general stats from text indata

2023-06-25 Thread tomas
On Sun, Jun 25, 2023 at 08:28:05AM +0200, Emanuel Berg wrote: > tomas wrote: > > I mean a general tool, but with options to tweak the > report included, of course. > >>> > >>> If you can bear some tweaking, R is it. > >> > >> Sure! Let's run R on this e-mail. Does it work and if so, wha

Re: FOSS tool to do general stats from text indata

2023-06-24 Thread Emanuel Berg
tomas wrote: I mean a general tool, but with options to tweak the report included, of course. >>> >>> If you can bear some tweaking, R is it. >> >> Sure! Let's run R on this e-mail. Does it work and if so, what >> does it say? > > T a generic question -- a generic answer R is a program

Re: FOSS tool to do general stats from text indata

2023-06-24 Thread tomas
On Sat, Jun 24, 2023 at 10:00:05PM +0200, Emanuel Berg wrote: > tomas wrote: > > >> Is there a CLI and FOSS tool that creates stats from text > >> indata - e.g., > >> > >> $ txt2stats path/to/indata/*.txt > >> > >> I mean a general tool, but with options to tweak the report > >> included, of c

Re: FOSS tool to do general stats from text indata

2023-06-24 Thread John Hasler
Emanuel Berg writes: > Sure! Let's run R on this e-mail. Does it work and if so, what > does it say? Run 'apt-cache show r-base'. You will want to look at all the 'r-cran' packages for one that does what you need. -- John Hasler j...@sugarbit.com Elmwood, WI USA

Re: FOSS tool to do general stats from text indata

2023-06-24 Thread Emanuel Berg
tomas wrote: >> Is there a CLI and FOSS tool that creates stats from text >> indata - e.g., >> >> $ txt2stats path/to/indata/*.txt >> >> I mean a general tool, but with options to tweak the report >> included, of course. > > If you can bear some tweaking, R is it. Sure! Let's run R on this e-

Re: FOSS tool to do general stats from text indata

2023-06-24 Thread Emanuel Berg
Cousin Stanley wrote: > If you have python programming skills, you might consider > NLTK Unbelievable if there are no such tools anywhere already, but I don't have one either so maybe there aren't then? -- underground experts united https://dataswamp.org/~incal

Re: FOSS tool to do general stats from text indata

2023-06-24 Thread Emanuel Berg
Joel Roth wrote: > A basic search finds this web tool: > > https://www.usingenglish.com/resources/text-statistics/ Cool, I'll get back to you when I tried it God willing ... > Otherwise, I think you'll have to write your own -- or hire > someone (like me :^) to write one for you. Surely there m

Re: FOSS tool to do general stats from text indata

2023-06-24 Thread Emanuel Berg
paulf wrote: >>> I don't know about all of your wishlist, but gnuplot is >>> the proper tool for taking data from, say, a CSV file, and >>> putting it into graphs of various types. >> >> Well, gnuplot is great obviously but is more a tool to >> visualize data, organized data, here we need a tool

Re: FOSS tool to do general stats from text indata

2023-06-24 Thread Cousin Stanley
On 2023-06-23 13:30, Emanuel Berg wrote: > Is there a CLI and FOSS tool that creates stats from text > indata - e.g., > >$ txt2stats path/to/indata/*.txt > > I mean a general tool, but with options to tweak the report > included, of course. > > To produce neat stats, maybe even figures, and g

Re: FOSS tool to do general stats from text indata

2023-06-23 Thread tomas
On Fri, Jun 23, 2023 at 10:20:50PM +0200, Emanuel Berg wrote: > Is there a CLI and FOSS tool that creates stats from text > indata - e.g., > > $ txt2stats path/to/indata/*.txt > > I mean a general tool, but with options to tweak the report > included, of course. If you can bear some tweaking,

Re: FOSS tool to do general stats from text indata

2023-06-23 Thread Joel Roth
On Fri, Jun 23, 2023 at 10:20:50PM +0200, Emanuel Berg wrote: > Is there a CLI and FOSS tool that creates stats from text > indata - e.g., > > $ txt2stats path/to/indata/*.txt > > I mean a general tool, but with options to tweak the report > included, of course. > > To produce neat stats, mayb

Re: FOSS tool to do general stats from text indata

2023-06-23 Thread paulf
On Fri, 23 Jun 2023 23:05:10 +0200 Emanuel Berg wrote: > paulf wrote: > > > I don't know about all of your wishlist, but gnuplot is the > > proper tool for taking data from, say, a CSV file, and > > putting it into graphs of various types. > > Well, gnuplot is great obviously but is more a tool

Re: FOSS tool to do general stats from text indata

2023-06-23 Thread Emanuel Berg
paulf wrote: > I don't know about all of your wishlist, but gnuplot is the > proper tool for taking data from, say, a CSV file, and > putting it into graphs of various types. Well, gnuplot is great obviously but is more a tool to visualize data, organized data, here we need a tool to analyze and

Re: FOSS tool to do general stats from text indata

2023-06-23 Thread paulf
On Fri, 23 Jun 2023 22:20:50 +0200 Emanuel Berg wrote: > Is there a CLI and FOSS tool that creates stats from text > indata - e.g., > > $ txt2stats path/to/indata/*.txt > > I mean a general tool, but with options to tweak the report > included, of course. > > To produce neat stats, maybe eve

FOSS tool to do general stats from text indata

2023-06-23 Thread Emanuel Berg
Is there a CLI and FOSS tool that creates stats from text indata - e.g., $ txt2stats path/to/indata/*.txt I mean a general tool, but with options to tweak the report included, of course. To produce neat stats, maybe even figures, and generate fun facts of the kind The longest word that occ