Help with a script to scrape html table to csv

m.traeumner · June 16, 2021, 8:50am

Hi @robb,
there was one comment to the post:

Cool idea, but doesn’t work for a lot of HTML files. Mine have the data inside the table on its own line (i.e. <td>\ncell value\n</td> ), and this script ends up stripping out that data because it removes every line that doesn’t have a table tag in it.

Perhaps this is the problem. I think you could analyse your html site and have to change the sed-settings.

I think it’s not the html format, it’s the table format.