[EMAIL PROTECTED]
To subscribe or unsubscribe via the World Wide Web, visit
http://cms-list.org/mailman/listinfo/cms-list
or, via email, send a message with subject or body 'help' to
[EMAIL PROTECTED]
You can reach the person managing the list at
[EMAIL PROTECTED]
When replying, please edit your Subject line so it is more specific
than "Re: Contents of cms-list digest..."
Today's Topics:
1. Database Publishing vs. Content Management (Bob Puccinelli)
2. RE: Database Publishing vs. Content Management (Max Dunn)
3. Data Extraction (Hari M)
4. Re: Data Extraction (Andy McKay)
5. Re: Data Extraction (Russell Nakano)
6. RE: Data Extraction - use the Practical Extraction and
Reporting Language (Peter Kappus)
7. Re: Data Extraction - use the Practical Extraction and Reporting Language (Lux)
8. RE: Data Extraction - use the Practical Extraction
and Reporting Language (Charles Reitzel)
--__--__--
Message: 1
Reply-To: <[EMAIL PROTECTED]>
From: "Bob Puccinelli" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Date: Mon, 21 Oct 2002 15:19:24 -0400
Subject: [cms-list] Database Publishing vs. Content Management
I am look for help in determining the correct solution for creating a
document to publish (Print, XML, HTML, PDF) which will be made up of
standard and variable elements assembled based on pre-defined business
rules. When would a database publishing tool be used such as InSystems vs.
a content/document management tool such as Documentum. The document might
have any number of combinations of "elements" both graphically and text
based. There is also a requirement for collaboration to review and approve
the text based elements both internal and external to the organization and
there is a requirement for integration with external data sources.
Another question would be "what is the difference between database
publishing and content management?"
Any help would be appreciated.
--__--__--
Message: 2
From: "Max Dunn" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Subject: RE: [cms-list] Database Publishing vs. Content Management
Date: Mon, 21 Oct 2002 13:01:58 -0700
Another question would be "what is the difference between database publishing and content management?"
I've worked a long long time in database publishing, and over the past 5 years in some forms of content management.
I would say that the main traditional difference is the same as the difference between "publishing" vs. "management," though there is some overlap. For example, as a company providing database publishing services we are a black box: we expect data/documents in, and we return PDF or PostScript for print output, and update the data/content behind an interactive Web site for Web delivery. We could care less about the internal processes of the company we are working with (i.e. which authoring tool/database they used, the type of approval workflow), as long as the data and documents conform to expected formats/standards. Content management, on the other hand, includes other dimensions as well: content creation and approval, i.e. authoring and management as well as publishing. Sounds like you definitely need to delve into the content management side given your requirements. Max --__--__-- Message: 3 From: "Hari M" <[EMAIL PROTECTED]> To: [EMAIL PROTECTED] Date: Mon, 21 Oct 2002 18:35:57 -0700 Subject: [cms-list] Data Extraction Can anyone recommend a data extraction software that help transfer content from a Web-site to a CMS? Thank you. HM _________________________________________________________________ Surf the Web without missing calls! Get MSN Broadband. http://resourcecenter.msn.com/access/plans/freeactivation.asp --__--__-- Message: 4 From: "Andy McKay" <[EMAIL PROTECTED]> To: "Hari M" <[EMAIL PROTECTED]>, <[EMAIL PROTECTED]> Subject: Re: [cms-list] Data Extraction Date: Mon, 21 Oct 2002 21:24:24 -0700 Thats a very open ended and impossible to answer question without knowing at least the CMS and the website data you want to extract. Most CMS's have some functionality for doing this, consult your CMS documentation... -- Andy McKay www.agmweb.ca ----- Original Message ----- From: "Hari M" <[EMAIL PROTECTED]> To: <[EMAIL PROTECTED]> Sent: Monday, October 21, 2002 6:35 PM Subject: [cms-list] Data Extraction
Can anyone recommend a data extraction software that help transfer content from a Web-site to a CMS? Thank you. HM _________________________________________________________________ Surf the Web without missing calls! Get MSN Broadband. http://resourcecenter.msn.com/access/plans/freeactivation.asp -- http://cms-list.org/ trim your replies for good karma.
--__--__-- Message: 5 From: "Russell Nakano" <[EMAIL PROTECTED]> To: <[EMAIL PROTECTED]> Subject: Re: [cms-list] Data Extraction Date: Tue, 22 Oct 2002 06:32:38 -0700 The exact nature of a data extraction problem can vary greatly. To be fair to the original question, here's a specific situation: a. In a large web site, the HTML component is greater than, say 5,000 pages. b. The content needs to be extracted into XML, typically to be used with a content delivery engine and a form-based editing environment. c. The underlying form of the original web site, or "templates," need to be recovered. d. The content needs to be extracted quickly, accurately, and economically. If this profile fits, check out www.nahava.com. Disclaimer: I am writing the software. Having said that, feel free to recommend "must-have" features or to suggest web sites that fit the profile above. Have fun, Russell ----- Original Message ----- From: "Hari M" <[EMAIL PROTECTED]> To: <[EMAIL PROTECTED]> Sent: Monday, October 21, 2002 6:35 PM Subject: [cms-list] Data Extraction
Can anyone recommend a data extraction software that help transfer content from a Web-site to a CMS? Thank you. HM
--__--__-- Message: 6 From: Peter Kappus <[EMAIL PROTECTED]> To: [EMAIL PROTECTED] Subject: RE: [cms-list] Data Extraction - use the Practical Extraction and Reporting Language Date: Tue, 22 Oct 2002 09:50:18 -0700 Perl Perl Perl! Depending on your love of programming, perl can quickly become your best friend when doing any "Practical Extraction and Reporting". It's free, relatively easy to get started with, and will let you do some incredibly powerful acrobatics with very little effort. (what other language will let you perform a RegExp with two slashes?) Of course, beware, as Perl is sometimes referred to as the Pathologically Eclectic Rubbish Lister. I've also heard that Python and Ruby are good for this sort of thing, but Perl has been around a lot longer and has an extensive library of modules to keep you from reinventing the wheel. In fact, I recently wrote a simple localization tool in perl which extracts text from ASP files, stores them as XML (for each locale) allows forms based in-line editing, and then publishes new ASP pages in each language using the updated XML. Quick, dirty, but it's been in use at my old company for over a year. Of course, unlike me, you may not run home from work to write code in your free time...in which case, you should probably talk to Russell. Anyone else have experience with Python or Ruby for extracting data and building XML? Good luck, -Peter ----- Original Message ----- From: "Hari M" <[EMAIL PROTECTED]> To: <[EMAIL PROTECTED]> Sent: Monday, October 21, 2002 6:35 PM Subject: [cms-list] Data Extraction
Can anyone recommend a data extraction software that help transfer content from a Web-site to a CMS? Thank you. HM
--
http://cms-list.org/
trim your replies for good karma.
--__--__--
Message: 7
Date: Tue, 22 Oct 2002 12:27:47 -0500
Subject: Re: [cms-list] Data Extraction - use the Practical Extraction and Reporting Language
From: Lux <[EMAIL PROTECTED]>
To: [EMAIL PROTECTED]
On Tuesday, October 22, 2002, at 11:50 AM, Peter Kappus wrote:
Anyone else have experience with Python or Ruby for extracting data and building XML?
Ruby has really nice regex support, and since regexes are also objects, it ties in nicely with the rest of the language. This leads me to think that some of the new Perl6 regex changes might have been inspired by Perl's shiny new cousin... The new Perl6 regex features give me dirty dreams. I can't wait! ;) Cheers, Lux
Good luck, -Peter ----- Original Message ----- From: "Hari M" <[EMAIL PROTECTED]> To: <[EMAIL PROTECTED]> Sent: Monday, October 21, 2002 6:35 PM Subject: [cms-list] Data ExtractionCan anyone recommend a data extraction software that help transfer content from a Web-site to a CMS? Thank you. HM-- http://cms-list.org/ trim your replies for good karma. -- http://cms-list.org/ trim your replies for good karma.
-- John Luxford Simian Systems _______________________ phone : 204.946.5955 email : [EMAIL PROTECTED] web : www.simian.ca _______________________ web content management application development consulting and training --__--__-- Message: 8 Date: Tue, 22 Oct 2002 13:47:10 -0400 To: [EMAIL PROTECTED] From: Charles Reitzel <[EMAIL PROTECTED]> Subject: RE: [cms-list] Data Extraction - use the Practical Extraction and Reporting Language E.g. http://search.cpan.org/author/GLENNWOOD/Scraper-2.26/ At 09:50 AM 10/22/2002 -0700, Peter Kappus wrote:
Perl Perl Perl! Depending on your love of programming, perl can quickly become your best friend when doing any "Practical Extraction and Reporting". It's free, relatively easy to get started with, and will let you do some incredibly powerful acrobatics with very little effort. (what other language will let you perform a RegExp with two slashes?) Of course, beware, as Perl is sometimes referred to as the Pathologically Eclectic Rubbish Lister.
--__--__-- -- http://cms-list.org/ End of cms-list Digest -- http://cms-list.org/ trim your replies for good karma.
