Hi @robb,
there was one comment to the post:
Cool idea, but doesn’t work for a lot of HTML files. Mine have the data inside the table on its own line (i.e.
<td>\ncell value\n</td>
), and this script ends up stripping out that data because it removes every line that doesn’t have a table tag in it.
Perhaps this is the problem. I think you could analyse your html site and have to change the sed-settings.
I think it’s not the html format, it’s the table format.