Page 1 of 1

[Solved] LibreOffice File format error found at SAXParse

PostPosted: Sat Dec 24, 2016 7:19 am
by chrisrw
hello,
Please help me i have the same issue with my files.i will really appreciate if you fix it for me.
here is the link you can access it from.https://wetransfer.com/downloads/059862 ... 712/9fafa3
or https://drive.google.com/drive/my-drive the file is assignment ICT4D
thank you
chris

Re: [Solved] LibreOffice File format error found at SAXParse

PostPosted: Sat Dec 24, 2016 7:59 am
by chrisrw
Hello,
Please help me.i have the same issue with my file and i have to submit it not later than today.
the fil is too big so you can access it on this link https://we.tl/khVn4jDjRg

I will appreciate your help.
thank you
chris

Re: [Solved] LibreOffice File format error found at SAXParse

PostPosted: Sat Dec 24, 2016 10:47 am
by RoryOF
chrisrw wrote:hello,
Please help me i have the same issue with my files.i will really appreciate if you fix it for me.
here is the link you can access it from.https://wetransfer.com/downloads/059862 ... 712/9fafa3
or https://drive.google.com/drive/my-drive the file is assignment ICT4D
thank you
chris


Try opening this using AbWord.

Re: [Solved] LibreOffice File format error found at SAXParse

PostPosted: Sat Dec 24, 2016 1:20 pm
by John_Ha
Try this file https://dl.dropboxusercontent.com/u/100 ... UTLINE.odt.

I merely opened it with Apache OpenOffice and saved it. Please let us know if any content was lost so we can recommend this method to others if it works OK.

Re: [Solved] LibreOffice File format error found at SAXParse

PostPosted: Sat Dec 24, 2016 1:29 pm
by RoryOF
I got nine pages in the file with OpenOffice, thirteen in Abiword (from extra content). There was no error reported on examination of the .xml file.

Re: [Solved] LibreOffice File format error found at SAXParse

PostPosted: Sat Dec 24, 2016 2:42 pm
by John_Ha
chrisw

Your method of formatting leaves a lot to be desired. Go View > Non printing characters (or click the ΒΆ icon) to see what you are doing.

Ideally you should be using Styles, but if you want to apply direct formatting, you should be using things like Format > Paragraph; or Align left/centre/right; and/or using the built in tabs or setting your own left/centre and right tabs. Some text is justified (stretched to reach both margins) some is not. You are using multiple different fonts. Some paragraphs start outside the page margins. Different paragraphs have Format > Paragraph > Indents and spacing > Gap after paragraph etc set to different values.

See the uploaded file here where I have removed all formatting and set all paragraphs to 2mm gap below. Now apply your formatting - I doubt it will take you more than 10 minutes and it will look much more professional. Learn about the Wrap (don't use multiple new paragraphs!!) and Anchor options for the images. Right click an image for the options including adding a caption.

Can I suggest you read the up to date Writer guide in my footer - it is written for students.

Clipboard01.png
Poor formatting

Re: [Solved] LibreOffice File format error found at SAXParse

PostPosted: Sat Dec 24, 2016 4:30 pm
by John_Ha
chrisw

See this file - it contains all the text from your file I downloaded but has no formatting.

I got all the text out by:

1 Extract \word\document.xml by un-ZIPping the .docx file
2 Open document.xml with Notepad++
3 Go Search > Replace ..., with search argument <[^>]+> and replace argument is blank. Be sure to tick Regular Expressions. Click Replace All.

This removes all the XML tags and leaves just the text.

Clipboard01.png
Notepad++ Search using Replace to remove all XML tags

You cannot do this in Writer because the XML contains more than 64k (actually 65,536) characters all in one single paragraph. You can paste the entire XML into Writer but as soon as you then try to do anything, Writer crashes because you have exceeded the Writer 64k paragraph limit. If you want to use Writer you need to split document.xml into parts where each part has less than 64k characters.