February 2007

Blossoms and the Berkeley winter (spring)?

Some nice blossoms in Berkeley....

(I'm demonstrating the blogging of multiple Flickr photos with Flock with this post)

Flickr

Comments (0)

Permalink

Notelets for 2007.02.19

I may have to get in the business of parsing Excel spreadsheets, making use of information I find at OpenOffice.org's Documentation of the Microsoft Excel File Format Excel Versions 2, 3, 4, 5, 95, 97, 2000, XP, 2003 and sc: Spreadsheet Project -- Microsoft Excel - Wikipedia, the free encyclopedia

QEDWiki is ready to try. Will it make mashups easier, even trivial, to create?

I just posted a query on the Flickr discussion group: WSDL from Flickr: unorthodox SOAP invocation?:

    I've been interested in generating WSDL from the Flickr reflection methods to generate a library that could better keep up with the changes in the Flickr APIs. However, I've run into a problem that stems from what I believe to be either unorthodox SOAP syntax in Flickr -- or just the limitations in my own understanding of WSDL and SOAP.

Let's see what I hear back.

Notelets

Comments (0)

Permalink

Extracting text from a Word document

Although I'm writing my mashup book in Microsoft Word, I'd like to publish it in a variety of forms, including HTML, various varieties of XML, PDF, wiki-markup. There are various ways to extract content out of my Word documents, including Word macros, external scripts using the COM interface, or saving the Word 2003 documents as Word XML. I'm partial to using Python to do some simple extraction of text as a first step:

   
import win32com.client  
wd = win32com.client.Dispatch("Word.Application")  
doc = wd.Documents.Open(r'D:\\Document\\PersonalInfoRemixBook\\858Xch05__.doc')  
print doc.Content.Text

I've not been able to find complete reference documentation for the Word 2003 object model. Word 2003 Object Model was a blank page for me under "Objects". Best, probably, to look at documentation for Office XP.

Uncategorized

Comments (0)

Permalink