Hi Rob!

There is a conversion script by Borislav Manolov  and Magnus Manske: 
https://toolserver.org/~magnus/html2wiki.php It does a good job when the markup 
is simple. There has to be a download version somewhere, too.

The other way would be to write a little maintenance script which processes the 
HTML using php's DOMDocument or SimpleXML and produces WikiText. That's a 
little more work to do, but when you have a uniform source html it's not too 
hard and delivers pretty good results.

--
Robert Vogel
Social Web Technologien
Softwareentwicklung
Hallo Welt! - Medienwerkstatt GmbH



Residenzstraße 2
93047 Regensburg

Tel.  +49 (0) 941 – 66 0 80-198
Fax   +49 (0) 941 - 66 0 80-189

www.hallowelt.biz
[email protected]


Sitz: Regensburg
Amtsgericht: Regensburg
Handelsregister: HRB 10467
E.USt.Nr.: DE 253050833
Geschäftsführer: Anja Ebersbach, Markus Glaser, Dr. Richard Heigl, Radovan 
Kubani


-----Ursprüngliche Nachricht-----
Von: [email protected] 
[mailto:[email protected]] Im Auftrag von Rob Sporing
Gesendet: Mittwoch, 5. Juni 2013 00:56
An: MediaWiki announcements and site admin list
Betreff: Re: [MediaWiki-l] Move from PBwiki to Mediawiki

Thanks for the Reply.

Exporting from PBworks gives me one HTML file per entry. Most of the entries 
are very straight forward text entries. Only a few have some simple tables and 
images. Nothing very complicated. If the tables even get a little messed up I 
can easily fixed that later because there are so few of them. If you know of a 
automated import tool that can help that would be great. Thanks again.


On Tue, Jun 4, 2013 at 6:35 PM, Markus Glaser <[email protected]> wrote:

> Hi Rob,
>
> my company has done several migrations to MediaWiki before where HTML 
> was the intermediate format. There are HTML to MW converters as well 
> as automated import tools for MediaWiki articles.
>
> I do not know much about PBWiki. From my experience with other 
> systems, though, you usually need to preprocess the data. Is the 
> export one big HTML file or one file per article? Is it easy to parse 
> (e.g. does it have some complex HTML structures and layout)? How about 
> files, e.g. images (they need to be imported separately).
>
> If you could give me some more details I might be able to point you to 
> some scripts you could try.
>
> Best,
> Markus
>
> Markus Glaser
> Social Web Technologien
> Leiter Softwareentwicklung
> Hallo Welt! - Medienwerkstatt GmbH
>
>
>
> Residenzstraße 2
> 93047 Regensburg
>
> Tel.   +49 (0) 941 - 66 0 80 - 192
> Fax.  +49 (0) 941 - 66 0 80 - 189
>
> www.hallowelt.biz
> [email protected]
>
> Sitz: Regensburg
> Handelsregister: HRB 10467
> E.USt.Nr.: DE 253050833
> Geschäftsführer:
> Anja Ebersbach, Markus Glaser,
> Dr. Richard Heigl, Radovan Kubani
>
> -----Ursprüngliche Nachricht-----
> Von: [email protected] [mailto:
> [email protected]] Im Auftrag von Rob Sporing
> Gesendet: Dienstag, 4. Juni 2013 18:23
> An: [email protected]
> Betreff: [MediaWiki-l] Move from PBwiki to Mediawiki
>
> Does anyone have any tips on the best way to transfer a wiki from 
> PBwiki to Mediawiki? I have a large amount of entries that need to be 
> moved. I can export PBwiki to a HTML file. Thanks in advance.
>
> Rob
> _______________________________________________
> MediaWiki-l mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
>
> _______________________________________________
> MediaWiki-l mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
>
>


--
Rob Sporing
Bethpage Public Library
_______________________________________________
MediaWiki-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
_______________________________________________
MediaWiki-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/mediawiki-l

Reply via email to