Discussion:
html table to csv
(too old to reply)
Brian Jordan
2023-11-01 15:44:21 UTC
Permalink
I have a lot of html tables which I want to convert and then, via
Fireworkz, use to produce draw files for use in Ovation Pro and PrintPDF
to eventually make a pdf booklet. The bit from fireworkz onwards is tried
and tested here and is fairly straightforward, the table to csv bit less
so. It looks like some fairly heavy duty searching and replacing will be
needed at first unless there is a program somewhere which might help me;
is there such a thing?
I am aware of some online stuff under Windows which might help but would
really like to do the whole job under RISC OS. Any thoughts appreciated.
Thanks
B
--
_____________________________________________________________________

Brian Jordan
***@btinternet.com
RISC OS 5.28 (16-Dec-20) on Raspberry Pi
_____________________________________________________________________
Paul Sprangers
2023-11-01 15:56:12 UTC
Permalink
Post by Brian Jordan
It looks like some fairly heavy duty searching and replacing will be
needed at first unless there is a program somewhere which might help me;
is there such a thing?
You might give !ConvText a try. It's at
https://riscos.sprie.nl/sprang.riscos/Downloads/ConvText.zip

Paul
--
https://riscos.sprie.nl
Brian Jordan
2023-11-01 16:04:30 UTC
Permalink
In article <***@sprie.nl>,
Paul Sprangers <***@sprie.nl> wrote:
Thanks

[Snip]
Post by Paul Sprangers
You might give !ConvText a try. It's at
https://riscos.sprie.nl/sprang.riscos/Downloads/ConvText.zip
Paul
B
--
_____________________________________________________________________

Brian Jordan
***@btinternet.com
RISC OS 5.28 (16-Dec-20) on Raspberry Pi
_____________________________________________________________________
Chris Newman
2023-11-01 16:32:16 UTC
Permalink
Post by Brian Jordan
I have a lot of html tables which I want to convert and then, via
Fireworkz, use to produce draw files for use in Ovation Pro and
PrintPDF to eventually make a pdf booklet. The bit from fireworkz
onwards is tried and tested here and is fairly straightforward, the
table to csv bit less so. It looks like some fairly heavy duty
searching and replacing will be needed at first unless there is a
program somewhere which might help me; is there such a thing? I am
aware of some online stuff under Windows which might help but would
really like to do the whole job under RISC OS. Any thoughts
appreciated. Thanks B
CSV Edit Bernard Veasey

CSVamp Ray Favre
--
Chris
Brian Jordan
2023-11-01 16:34:47 UTC
Permalink
Thanks

In article <***@npost.uk>,
Chris Newman <***@npost.uk> wrote:

[Snip]
Post by Chris Newman
CSV Edit Bernard Veasey
CSVamp Ray Favre
B
--
_____________________________________________________________________

Brian Jordan
***@btinternet.com
RISC OS 5.28 (16-Dec-20) on Raspberry Pi
_____________________________________________________________________
Jean-Michel
2023-11-01 19:45:54 UTC
Permalink
Post by Brian Jordan
Thanks
[Snip]
Post by Chris Newman
CSV Edit Bernard Veasey
CSVamp Ray Favre
B
Thanks for pointing out these programs, they are always useful.
Some time ago I worked on !Psifs and I used the SIBO to RISC OS converters
(They are on Thomas Millius' site). Very convenient.

I just took them out and was able to do the conversion you asked for from
a csv file extracted from !Fireworks.


I have sent you an example to test at your address.
--
Jean-Michel
Jean-Michel
2023-11-01 19:52:51 UTC
Permalink
Post by Jean-Michel
Post by Brian Jordan
Thanks
[Snip]
Post by Chris Newman
CSV Edit Bernard Veasey
CSVamp Ray Favre
B
Thanks for pointing out these programs, they are always useful.
Some time ago I worked on !Psifs and I used the SIBO to RISC OS converters
(They are on Thomas Millius' site). Very convenient.
I just took them out and was able to do the conversion you asked for from
a csv file extracted from !Fireworks.
I have sent you an example to test at your address.
Sorry, I just reread your message and the conversion must be done the
other way!!! :-(

HTML to CSV not CSV to HTML....
--
Jean-Michel
Chris Newman
2023-11-01 16:41:59 UTC
Permalink
Post by Brian Jordan
I have a lot of html tables which I want to convert and then, via
Fireworkz, use to produce draw files for use in Ovation Pro and
PrintPDF to eventually make a pdf booklet. The bit from fireworkz
onwards is tried and tested here and is fairly straightforward, the
table to csv bit less so. It looks like some fairly heavy duty
searching and replacing will be needed at first unless there is a
program somewhere which might help me; is there such a thing? I am
aware of some online stuff under Windows which might help but would
really like to do the whole job under RISC OS. Any thoughts
appreciated. Thanks B
I have !UnHTML Mike Williams 1997. Very old but loaded in R5.19. Purpose
see below from its Help file. I can send toyou. So old I presume no
copyright problems.

Converts HTML to Plain Text, Impression, or Draw Textarea format.
Extracts bookmarks from links found on HTML pages.

Author: Mike Williams
***@econym.demon.co.uk

Usage: Choose the output format you require from the iconbar menu.
Drag a HTML file to the UnHTML icon on the iconbar
After a few seconds a save box will open
Drag the resulting text to a filer or application

Plain Text Format:
The HTML tags are ripped out, special characters are converted,
but no formatting is applied.
--
Chris
Harriet Bazley
2023-11-01 21:41:53 UTC
Permalink
On 1 Nov 2023 as I do recall,
Post by Brian Jordan
I have a lot of html tables which I want to convert and then, via
Fireworkz, use to produce draw files for use in Ovation Pro and PrintPDF
to eventually make a pdf booklet. The bit from fireworkz onwards is tried
and tested here and is fairly straightforward, the table to csv bit less
so. It looks like some fairly heavy duty searching and replacing will be
needed at first unless there is a program somewhere which might help me;
is there such a thing?
Current versions of EasiWriter can load HTML - you won't get any
cascading style sheet formatting, but I've tested it on some pages with
tables that I originally hand-crafted before uploading them to the host
site, and I can load them back into EasiWriter and select the 'Table'
region to save as a selection. EW exports tables as TSV, not CSV, but
I think most things that understand the latter also understand the
former; in the case of Fireworkz, it is perfectly possible to import
tab-separated files into a document as tables provided that they are
*filetyped* as CSV (&DFE).

So it depends how your HTML tables were originally created and how clean
the coding of them is, I suspect. If they genuinely are just tables of
data and not messed up with all sorts of layout stuff then you can load
the pages into EasiWriter, save the tables out as tab-separated text
selections, and bulk-filetype those files as CSV in order to import them
into Fireworkz as tables and/or spreadsheet cells, depending on what you
want to do with them there.
--
Harriet Bazley == Loyaulte me lie ==

I mean to live forever - or die trying!
Brian Jordan
2023-11-02 00:14:58 UTC
Permalink
Many thanks,

I have EasiWriter here although whether it's the current version will be
discovered in the morning.
Post by Harriet Bazley
On 1 Nov 2023 as I do recall,
[Snip my original query]
Post by Harriet Bazley
Current versions of EasiWriter can load HTML - you won't get any
cascading style sheet formatting, but I've tested it on some pages with
tables that I originally hand-crafted before uploading them to the host
site, and I can load them back into EasiWriter and select the 'Table'
region to save as a selection. EW exports tables as TSV, not CSV, but
I think most things that understand the latter also understand the
former; in the case of Fireworkz, it is perfectly possible to import
tab-separated files into a document as tables provided that they are
*filetyped* as CSV (&DFE).
That sounds promising
Post by Harriet Bazley
So it depends how your HTML tables were originally created and how clean
the coding of them is, I suspect. If they genuinely are just tables of
data and not messed up with all sorts of layout stuff then you can load
the pages into EasiWriter, save the tables out as tab-separated text
selections, and bulk-filetype those files as CSV in order to import them
into Fireworkz as tables and/or spreadsheet cells, depending on what you
want to do with them there.
I created the majority of these tables in lovingly hand crafted html back
in the mid '90s and before publishing them they were run through HTML
Tidy until they were squeaky clean. I am quite hopeful.
B
--
_____________________________________________________________________

Brian Jordan
***@btinternet.com
RISC OS 5.28 (16-Dec-20) on Raspberry Pi
_____________________________________________________________________
Brian Jordan
2023-11-02 09:53:00 UTC
Permalink
Even better!
Post by Brian Jordan
Many thanks,
I have EasiWriter here although whether it's the current version will be
discovered in the morning.
Not only do I have a version which works as Harriet describes it does
even better in that exporting as pdf is available in my version thus
cutting out a number of middle men.

[Snip]
B
--
_____________________________________________________________________

Brian Jordan
***@btinternet.com
RISC OS 5.28 (16-Dec-20) on Raspberry Pi
_____________________________________________________________________
Harriet Bazley
2023-11-02 10:40:51 UTC
Permalink
On 2 Nov 2023 as I do recall,
Post by Brian Jordan
Even better!
Post by Brian Jordan
Many thanks,
I have EasiWriter here although whether it's the current version will be
discovered in the morning.
Not only do I have a version which works as Harriet describes it does
even better in that exporting as pdf is available in my version thus
cutting out a number of middle men.
Oh, I assumed you actually needed the data in Fireworkz for calculation
purposes rather than layout....
--
Harriet Bazley == Loyaulte me lie ==

Eschew Obfuscation.
Brian Jordan
2023-11-02 11:18:49 UTC
Permalink
Post by Harriet Bazley
On 2 Nov 2023 as I do recall,
[Snip]
Post by Harriet Bazley
Oh, I assumed you actually needed the data in Fireworkz for calculation
purposes rather than layout....
Fair assumption but to explain...
The files I am converting are old Championship tables for a motor racing
club. I can't remember how they were produced, presumably in a
spreadsheet, but all calculations were completed before creating the web
tables. I no longer have the original files but am able to grab the
tables from the, soon to be closed, site. The club has asked if I can
grab all of the tables from 1996 to the present and produce an inclusive
pdf document. I have all the recent (post 2010) files here as Fireworkz
files from which I produce pdfs and HTML (Using Paul Vigay's Webworkz)
and the Fireworkz route for the old files suggested itself to me. The
Easiwriter solution makes it all so much easier, thank you.
B
--
_____________________________________________________________________

Brian Jordan
***@btinternet.com
RISC OS 5.28 (16-Dec-20) on Raspberry Pi
_____________________________________________________________________
Harriet Bazley
2023-11-02 18:59:52 UTC
Permalink
On 2 Nov 2023 as I do recall,
Post by Brian Jordan
The files I am converting are old Championship tables for a motor racing
club. I can't remember how they were produced, presumably in a
spreadsheet, but all calculations were completed before creating the web
tables. I no longer have the original files but am able to grab the
tables from the, soon to be closed, site. The club has asked if I can
grab all of the tables from 1996 to the present and produce an inclusive
pdf document. I have all the recent (post 2010) files here as Fireworkz
files from which I produce pdfs and HTML (Using Paul Vigay's Webworkz)
and the Fireworkz route for the old files suggested itself to me. The
Easiwriter solution makes it all so much easier, thank you.
Excellent news!
--
Harriet Bazley == Loyaulte me lie ==

Those of you who think you know everything are annoying those of us who do.
Richard Torrens (News)
2023-11-03 10:42:28 UTC
Permalink
Post by Brian Jordan
I have a lot of html tables which I want to convert and then, via
Fireworkz, use to produce draw files for use in Ovation Pro and PrintPDF
to eventually make a pdf booklet. The bit from fireworkz onwards is tried
and tested here and is fairly straightforward, the table to csv bit less
so. It looks like some fairly heavy duty searching and replacing will be
needed at first unless there is a program somewhere which might help me;
is there such a thing?
I am aware of some online stuff under Windows which might help but would
really like to do the whole job under RISC OS. Any thoughts appreciated.
Thanks
B
If you have Iris - it can export as Text. It uses TAB chars between cells.
--
------------------------------------------------------------------
Richard Torrens. News email address is valid - for a limited time only.
You must use the full News+***@Torrens.org as in the From address.
http://www.Torrens.org for genealogy, natural history, wild food, walks, cats
and more!
Brian Jordan
2023-11-03 11:55:14 UTC
Permalink
[Snip my original request]
Post by Richard Torrens (News)
If you have Iris - it can export as Text. It uses TAB chars between cells.
I do and this knowledge has added a further string to my bow, many thanks.
In the last few days through the help of folks in these parts I have gone
to an "I wonder if..." to a cup overflowing situation. Thanks to all for
your help.
B
--
_____________________________________________________________________

Brian Jordan
***@btinternet.com
RISC OS 5.28 (16-Dec-20) on Raspberry Pi
_____________________________________________________________________
Harriet Bazley
2023-11-04 12:57:51 UTC
Permalink
On 3 Nov 2023 as I do recall,
Post by Richard Torrens (News)
Post by Brian Jordan
I have a lot of html tables which I want to convert and then, via
Fireworkz, use to produce draw files for use in Ovation Pro and PrintPDF
to eventually make a pdf booklet. The bit from fireworkz onwards is tried
and tested here and is fairly straightforward, the table to csv bit less
so. It looks like some fairly heavy duty searching and replacing will be
needed at first unless there is a program somewhere which might help me;
is there such a thing?
I am aware of some online stuff under Windows which might help but would
really like to do the whole job under RISC OS. Any thoughts appreciated.
Thanks
B
If you have Iris - it can export as Text. It uses TAB chars between cells.
Even Netsurf does that...
--
Harriet Bazley == Loyaulte me lie ==

"An American is a man with two arms and four wheels".
Richard Torrens (News)
2023-11-05 14:50:15 UTC
Permalink
Post by Harriet Bazley
Post by Richard Torrens (News)
If you have Iris - it can export as Text. It uses TAB chars between cells.
Even Netsurf does that...
But it uses spaces - not TABs!
--
------------------------------------------------------------------
Richard Torrens. News email address is valid - for a limited time only.
You must use the full News+***@Torrens.org as in the From address.
http://www.Torrens.org for genealogy, natural history, wild food, walks, cats
and more!
Harriet Bazley
2023-11-05 16:12:46 UTC
Permalink
On 5 Nov 2023 as I do recall,
Post by Richard Torrens (News)
Post by Harriet Bazley
Post by Richard Torrens (News)
If you have Iris - it can export as Text. It uses TAB chars between cells.
Even Netsurf does that...
But it uses spaces - not TABs!
I'm definitely getting tabs, both from select-and-drag and from
Export->Text. Maybe it depends on the way the table was defined/laid
out in the first place? I've only been testing it on my own tables....
--
Harriet Bazley == Loyaulte me lie ==

Those who can't write, write manuals.
Vince M Hudd
2024-01-03 16:52:24 UTC
Permalink
Post by Brian Jordan
I have a lot of html tables which I want to convert and then, via
Fireworkz, use to produce draw files for use in Ovation Pro and PrintPDF
to eventually make a pdf booklet. The bit from fireworkz onwards is tried
and tested here and is fairly straightforward, the table to csv bit less
so. It looks like some fairly heavy duty searching and replacing will be
needed at first unless there is a program somewhere which might help me;
is there such a thing?
I see I'm late to the party on this one (this is my 'annual' usenet
catchup!) so you already have solutions suggested, but I may as well add
that WebChange (with the aid of a suitable script) can do this.

(Although I no longer have the script to hand that I was using at the
time, it's one of the things I used to do as a demo of the software).

The only flaw was that it would only be able to handle the first table
it encountered.
--
Vince M Hudd
Vince M Hudd
2024-01-06 18:21:19 UTC
Permalink
Post by Vince M Hudd
I see I'm late to the party on this one (this is my 'annual' usenet
catchup!) so you already have solutions suggested, but I may as well add
that WebChange (with the aid of a suitable script) can do this.
(Although I no longer have the script to hand that I was using at the
time, it's one of the things I used to do as a demo of the software).
The only flaw was that it would only be able to handle the first table
it encountered.
I was reminded today that I'd let the webchange.co.uk domain go, and
hadn't updated the softrock.co.uk site accordingly - so WebChange has been
missing in action for a while. (h/t to Bernard Boase for pointing it out)

As a quick fix, I've created a new subdomain - webchange.softrock.co.uk
and mapped it to the server space that webchange.co.uk was previously
using, and I've updated the link on the WebChange page on softrock.co.uk
to point to it.

So as of now, WebChange can once again be downloaded.

https://www.softrock.co.uk/products/webchange.html
http://webchange.softrock.co.uk/

The main site itself hasn't actually been properly updated, though (i.e. I
haven't run WebChange on it) - so other links remain broken. That'll give
lots of "we found problems on your website" spammers even more reasons to
email me. ;)
--
Vince M Hudd
Loading...