If you merchandise that big a file you should import directly into the workspace and not in the session without going through the transient space and using lots of memory. So use Workspace getImportContentHandler or Workspace importXML not the Session methods. Read the JSR-170 for the benefits. Florentchewy_fruit_circle wrote:> I'm currently trying to import an XML register into a bog standard alter> repository.> The problem is the register is 72.5mb containing around 200,000 elements (yes> they are all required). This is currently taking about 90 mins (give or> take) to get into derby and thats with indexing off.> > The measure wouldn't be such an issue if it didn't use 1.7Gb of RAM.> I've decorated a ContentHandler so it calls :> > grow modify(<workspace label>)> root deliver()> > where grow is the grow node from the channelise.> This is being called after every 500 start elements. The save just doesn't> seem to color the contents that have been parsed to the persistent hold on. > This is the same if I use derby or Oracle as storage. The only time things> seem to go away to be persisted is when the endDocument is hit.> > undergo I missed something blindingly obvious here? I really don't object> everyone having a bit of a chuckle at me. I just want to get this sorted> out.> > > thanks> -- Florent Guillaume. Director of R&D. NuxeoOpen obtain Enterprise circumscribe Management (ECM)http://www nuxeo com http://www nuxeo org +33 1 40 33 79 87
Related article:
http://mail-archives.apache.org/mod_mbox/jackrabbit-users/200709.mbox/%3Cfccihu$bn1$1@sea.gmane.org%3E
comments | Add comment | Report as Spam
|