Send cms-list mailing list submissions to
[EMAIL PROTECTED]

To subscribe or unsubscribe via the World Wide Web, visit
http://cms-list.org/mailman/listinfo/cms-list
or, via email, send a message with subject or body 'help' to
[EMAIL PROTECTED]

You can reach the person managing the list at
[EMAIL PROTECTED]

When replying, please edit your Subject line so it is more specific
than "Re: Contents of cms-list digest..."


Today's Topics:

1. Database Publishing vs. Content Management (Bob Puccinelli)
2. RE: Database Publishing vs. Content Management (Max Dunn)
3. Data Extraction (Hari M)
4. Re: Data Extraction (Andy McKay)
5. Re: Data Extraction (Russell Nakano)
6. RE: Data Extraction - use the Practical Extraction and
Reporting Language (Peter Kappus)
7. Re: Data Extraction - use the Practical Extraction and Reporting Language (Lux)
8. RE: Data Extraction - use the Practical Extraction
and Reporting Language (Charles Reitzel)

--__--__--

Message: 1
Reply-To: <[EMAIL PROTECTED]>
From: "Bob Puccinelli" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Date: Mon, 21 Oct 2002 15:19:24 -0400
Subject: [cms-list] Database Publishing vs. Content Management

I am look for help in determining the correct solution for creating a
document to publish (Print, XML, HTML, PDF) which will be made up of
standard and variable elements assembled based on pre-defined business
rules. When would a database publishing tool be used such as InSystems vs.
a content/document management tool such as Documentum. The document might
have any number of combinations of "elements" both graphically and text
based. There is also a requirement for collaboration to review and approve
the text based elements both internal and external to the organization and
there is a requirement for integration with external data sources.

Another question would be "what is the difference between database
publishing and content management?"

Any help would be appreciated.



--__--__--

Message: 2
From: "Max Dunn" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Subject: RE: [cms-list] Database Publishing vs. Content Management
Date: Mon, 21 Oct 2002 13:01:58 -0700

 Another question would be "what is the difference
 between database publishing and content management?"
I've worked a long long time in database publishing, and over the past 5
years in some forms of content management.

I would say that the main traditional difference is the same as the
difference between "publishing" vs. "management," though there is some
overlap.

For example, as a company providing database publishing services we are
a black box: we expect data/documents in, and we return PDF or
PostScript for print output, and update the data/content behind an
interactive Web site for Web delivery. We could care less about the
internal processes of the company we are working with (i.e. which
authoring tool/database they used, the type of approval workflow), as
long as the data and documents conform to expected formats/standards.

Content management, on the other hand, includes other dimensions as
well: content creation and approval, i.e. authoring and management as
well as publishing.

Sounds like you definitely need to delve into the content management
side given your requirements.

Max



--__--__--

Message: 3
From: "Hari M" <[EMAIL PROTECTED]>
To: [EMAIL PROTECTED]
Date: Mon, 21 Oct 2002 18:35:57 -0700
Subject: [cms-list] Data Extraction

Can anyone recommend a data extraction software that help transfer content
from a Web-site to a CMS?

Thank you.

HM



_________________________________________________________________
Surf the Web without missing calls! Get MSN Broadband.
http://resourcecenter.msn.com/access/plans/freeactivation.asp


--__--__--

Message: 4
From: "Andy McKay" <[EMAIL PROTECTED]>
To: "Hari M" <[EMAIL PROTECTED]>, <[EMAIL PROTECTED]>
Subject: Re: [cms-list] Data Extraction
Date: Mon, 21 Oct 2002 21:24:24 -0700

Thats a very open ended and impossible to answer question without knowing at
least the CMS and the website data you want to extract. Most CMS's have some
functionality for doing this, consult your CMS documentation...
--
  Andy McKay
  www.agmweb.ca

----- Original Message -----
From: "Hari M" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Sent: Monday, October 21, 2002 6:35 PM
Subject: [cms-list] Data Extraction


 Can anyone recommend a data extraction software that help transfer content
 from a Web-site to a CMS?

 Thank you.

 HM



 _________________________________________________________________
 Surf the Web without missing calls! Get MSN Broadband.
 http://resourcecenter.msn.com/access/plans/freeactivation.asp

 --
 http://cms-list.org/
 trim your replies for good karma.



--__--__--

Message: 5
From: "Russell Nakano" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Subject: Re: [cms-list] Data Extraction
Date: Tue, 22 Oct 2002 06:32:38 -0700

The exact nature of a data extraction problem can vary greatly.  To be fair
to the original question, here's a specific situation:

a. In a large web site, the HTML component is greater than, say 5,000 pages.
b. The content needs to be extracted into XML, typically to be used with a
content delivery engine and a form-based editing environment.
c. The underlying form of the original web site, or "templates," need to be
recovered.
d. The content needs to be extracted quickly, accurately, and economically.

If this profile fits, check out www.nahava.com.

Disclaimer: I am writing the software.  Having said that, feel free to
recommend "must-have" features or to suggest web sites that fit the profile
above.

Have fun,
Russell

----- Original Message -----
From: "Hari M" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Sent: Monday, October 21, 2002 6:35 PM
Subject: [cms-list] Data Extraction


 Can anyone recommend a data extraction software that help transfer content
 from a Web-site to a CMS?

 Thank you.

 HM


--__--__--

Message: 6
From: Peter Kappus <[EMAIL PROTECTED]>
To: [EMAIL PROTECTED]
Subject: RE: [cms-list] Data Extraction - use the Practical Extraction and
	 Reporting Language
Date: Tue, 22 Oct 2002 09:50:18 -0700

Perl Perl Perl!

Depending on your love of programming, perl can quickly become your best
friend when doing any "Practical Extraction and Reporting".  It's free,
relatively easy to get started with, and will let you do some incredibly
powerful acrobatics with very little effort. (what other language will let
you perform a RegExp with two slashes?)  Of course, beware, as Perl is
sometimes referred to as the Pathologically Eclectic Rubbish Lister.

I've also heard that Python and Ruby are good for this sort of thing, but
Perl has been around a lot longer and has an extensive library of modules to
keep you from reinventing the wheel.  In fact, I recently wrote a simple
localization tool in perl which extracts text from ASP files, stores them as
XML (for each locale) allows forms based in-line editing, and then publishes
new ASP pages in each language using the updated XML.  Quick, dirty, but
it's been in use at my old company for over a year.

Of course, unlike me, you may not run home from work to write code in your
free time...in which case, you should probably talk to Russell.

Anyone else have experience with Python or Ruby for extracting data and
building XML?

Good luck,
-Peter



----- Original Message -----
From: "Hari M" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Sent: Monday, October 21, 2002 6:35 PM
Subject: [cms-list] Data Extraction


 Can anyone recommend a data extraction software that help transfer content
 from a Web-site to a CMS?

 Thank you.

 HM

--
http://cms-list.org/
trim your replies for good karma.

--__--__--

Message: 7
Date: Tue, 22 Oct 2002 12:27:47 -0500
Subject: Re: [cms-list] Data Extraction - use the Practical Extraction and Reporting Language
From: Lux <[EMAIL PROTECTED]>
To: [EMAIL PROTECTED]


On Tuesday, October 22, 2002, at 11:50 AM, Peter Kappus wrote:
 Anyone else have experience with Python or Ruby for extracting data and
 building XML?
Ruby has really nice regex support, and since regexes are also objects,
it ties in nicely with the rest of the language.  This leads me to
think that some of the new Perl6 regex changes might have been inspired
by Perl's shiny new cousin...

The new Perl6 regex features give me dirty dreams.  I can't wait! ;)

Cheers,

Lux

 Good luck,
 -Peter



 ----- Original Message -----
 From: "Hari M" <[EMAIL PROTECTED]>
 To: <[EMAIL PROTECTED]>
 Sent: Monday, October 21, 2002 6:35 PM
 Subject: [cms-list] Data Extraction


 Can anyone recommend a data extraction software that help transfer
 content
 from a Web-site to a CMS?

 Thank you.

 HM

 --
 http://cms-list.org/
 trim your replies for good karma.
 --
 http://cms-list.org/
 trim your replies for good karma.


--

John Luxford
Simian Systems
_______________________
phone : 204.946.5955
email : [EMAIL PROTECTED]
web   : www.simian.ca
_______________________
web content management
application development
consulting and training


--__--__--

Message: 8
Date: Tue, 22 Oct 2002 13:47:10 -0400
To: [EMAIL PROTECTED]
From: Charles Reitzel <[EMAIL PROTECTED]>
Subject: RE: [cms-list] Data Extraction - use the Practical Extraction
  and Reporting Language

E.g.  http://search.cpan.org/author/GLENNWOOD/Scraper-2.26/

At 09:50 AM 10/22/2002 -0700, Peter Kappus wrote:
Perl Perl Perl!

Depending on your love of programming, perl can quickly become your best
friend when doing any "Practical Extraction and Reporting".  It's free,
relatively easy to get started with, and will let you do some incredibly
powerful acrobatics with very little effort. (what other language will let
you perform a RegExp with two slashes?)  Of course, beware, as Perl is
sometimes referred to as the Pathologically Eclectic Rubbish Lister.


--__--__--

--
http://cms-list.org/

End of cms-list Digest
--
http://cms-list.org/
trim your replies for good karma.

Reply via email to