HTML encoded spreadsheets?
- From: David Stanaway <david stanaway net>
- To: gnumeric-list gnome org
- Subject: HTML encoded spreadsheets?
- Date: Wed, 07 Sep 2005 17:53:21 -0500
I am having a problem with a few vendors sending me data that they thing
is an Excel spreadsheet, but it is infact an HTML file with an XLS
extention.
I suspect that it is actually created by Excel.
Here is the head of one sample:
<html xmlns:v="urn:schemas-microsoft-com:vml"
xmlns:o="urn:schemas-microsoft-com:office:office"
xmlns:x="urn:schemas-microsoft-com:office:excel"
xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv=Content-Type content="text/html; charset=us-ascii">
<meta name=ProgId content=Excel.Sheet>
<meta name=Generator content="Microsoft Excel 11">
<link rel=File-List href="foo_files/filelist.xml">
<link rel=Edit-Time-Data href="foo_files/editdata.mso">
<link rel=OLE-Object-Data href="foo_files/oledata.mso">
<!--[if !mso]>
<style>
v\:* {behavior:url(#default#VML);}
o\:* {behavior:url(#default#VML);}
x\:* {behavior:url(#default#VML);}
.shape {behavior:url(#default#VML);}
</style>
...
I am not sure how these files originate, but I suspect, it is through a
web app that outputs an html table with mime type for excel, then the
vendor uses "Save As.." in Excel and perhaps it defaults to "Save as
Webpage".
Regardless, Gnumeric fails to load these properly, and if I rename the
file to .html and open in gnumeric, I loose cell color formating.
I know this is a problem with the html format module not having a probe
method, and the developers want to avoid probe method proliferation as
it will adversely effect performance, but if these kind of files crop up
from normal use of Excel, they probably should be supported.
I have opened this bug previously when I was having problems with data
from another vendor.
http://bugzilla.gnome.org/show_bug.cgi?id=311879
but this file that is causing problems now seems to have been created by
Excel. This file also gives oocalc some grief too (Unlike the previous
one I had problems with).
I also do not get the foo_files/* files, just the foo.xls
[
Date Prev][
Date Next] [
Thread Prev][
Thread Next]
[
Thread Index]
[
Date Index]
[
Author Index]