Jump to content

Please read the Forum Rules before posting.

Photo
- - - - -

Lange's Commentary for e-Sword


  • Please log in to reply
42 replies to this topic

#11 Josh Bond

Josh Bond

    Administrator

  • Administrators
  • PipPipPipPipPip
  • 2,890 posts
  • LocationGallatin, TN
Offline

Posted 04 July 2011 - 10:35 PM

Could you create macros to do some of that repetitive stuff? I am assuming Word has macros. Again, years ago I had to put references (I think it was) into a certain format so a work could be included in Online Bible and I did a macro that saved me days of work. (Sorry for mentioning that uh other Bible program :-)


Yes, you can create macros in Word. I've downloaded a some of David Cox's macros, for things such as converting roman numerals to alphanumeric digits. I also made a few. Other times, I haven't gone to the trouble because the problem was very document specific.

Your welcome to mention any Bible software. :)

#12 James Rice

James Rice

    Moderator / Resource Builder

  • Banned
  • PipPipPipPipPip
  • 151 posts
  • LocationCleveland, Ohio
Offline

Posted 05 July 2011 - 03:30 PM

Count me in.

#13 LEB

LEB

    New to Bible Support

  • Veterans
  • Pip
  • 14 posts
Offline

Posted 05 July 2011 - 11:04 PM

That would be great to have.

#14 Ebed Doulos

Ebed Doulos

    e-Sword Fanatic

  • Members (T)
  • PipPipPipPipPip
  • 294 posts
Offline

Posted 06 July 2011 - 07:39 PM

I, too, would be interested and I thank you for your labor. Just out of curiosity, are you going to separate the volumes into the original 25, the reprint 12 or the 64 volumes that Logos uses. Or are you going with an entirely different schema?
Posted Image

#15 Josh Bond

Josh Bond

    Administrator

  • Administrators
  • PipPipPipPipPip
  • 2,890 posts
  • LocationGallatin, TN
Offline

Posted 07 July 2011 - 11:49 AM

The bulk of the line breaks were easy to remove, but that's also the least of the issues with this text.

This will be a HUGE project and Niobi really has his work cut out for him. The text was scanned from a scanner (OCR). So on almost every line, there's a typo or misspelling. Sentences sometimes end with odd punctuation (OCR typos), or begin with odd punctuation, or numbers mixed in odd places. There's no formatting (except odd formatting), no bold, no italics, no tabs, no indents, etc. Then there's the verse formatting. I converted some of the roman numerals to digits with a macro, but the verse formatting has many typos as well, so some of the roman numerals still exist in the verse references. Nearly every verse reference will have to be manually tweaked. All of which is typical for OCR text...

He will will almost have to go line by line through the text. A spell check would help, but be careful of greek/hebrew words that an English spellcheck won't recognize.


Then, you have the work of separating the verse and chapter and book comments properly, which just by itself, is a big undertaking.

#16 Vaughn

Vaughn

    e-Sword Addict

  • Veterans
  • PipPipPipPip
  • 74 posts
  • LocationKaufman, Texas
Offline

Posted 07 July 2011 - 01:57 PM

Let me add to the group. Anything and everything
Grace and Peace,
Your fellow Swordsman,
Vaughn R. Jacobs

#17 chosen

chosen

    Liking e-Sword

  • Members (T)
  • PipPip
  • 17 posts
Offline

Posted 08 July 2011 - 12:00 AM

The bulk of the line breaks were easy to remove, but that's also the least of the issues with this text.

This will be a HUGE project and Niobi really has his work cut out for him. The text was scanned from a scanner (OCR). So on almost every line, there's a typo or misspelling. Sentences sometimes end with odd punctuation (OCR typos), or begin with odd punctuation, or numbers mixed in odd places. There's no formatting (except odd formatting), no bold, no italics, no tabs, no indents, etc. Then there's the verse formatting. I converted some of the roman numerals to digits with a macro, but the verse formatting has many typos as well, so some of the roman numerals still exist in the verse references. Nearly every verse reference will have to be manually tweaked. All of which is typical for OCR text...

He will will almost have to go line by line through the text. A spell check would help, but be careful of greek/hebrew words that an English spellcheck won't recognize.


Then, you have the work of separating the verse and chapter and book comments properly, which just by itself, is a big undertaking.


Would you give me any Word macro to remove line breaks?

#18 patchworkid

patchworkid

    Resource Builder

  • Members (T)
  • PipPipPipPipPip
  • 1,554 posts
  • LocationOld England
Offline

Posted 08 July 2011 - 03:16 AM

Hi all. As you know this is a very big project for me. I have uploaded the Lang's Cmt (which is in Word Doc) - In Topics section.

If anyone would like to help, it would be great. This might help.


I searched for: ([a-z])(^13)([a-z])
Replaced: \1 \3

Searched: ([a-z],)(^13)([a-z])
Replaced: \1 \3

Searched: "
Replaced: "



Hey Niobi,

Attached is the sample document I sent. Here is what I did:

I searched for: ([a-z])(^13)([a-z])
Replaced: \1 \3

Searched: ([a-z],)(^13)([a-z])
Replaced: \1 \3

Searched: "
Replaced: "

BUT this will be a HUGE project. The text was scanned from a scanner (OCR). So on almost every line, there's a typo or misspelling. Sentences sometimes end with odd punctuation (OCR typos). There's no formatting, no bold, no italics, no tabs, no indents, etc. Then there's the verse formatting. I converted some of the roman numerals to digits, but the verse formatting has many typos as well, so some of the roman numerals still exist in the verse references.

What you have is a raw text dump, with tens of thousands of misspellings and typos. And you will almost have to go line by line through the text. A spell check would help, but be careful of greek/hebrew words that an English spellcheck won't find.

There's still some lines that will need the line break manually removed because the search missed it (we can't remove all line breaks because then we wouldn't have paragraphs). But 99% of the bad line breaks are gone.

In Word, when you use the Search/Replace feature, a Search Replace Box appears. Click on the More button. More options should appear. Click the Use Wildcards box.

Try searching for this: [a-z]^13[a-z]

And for the replacment text, don't put anything.

That will usually deletes most extra returns.You might need a space before and/or after ^13. it all depends on the document.

Thanks again for your help.
Patchworkid
Merismos the Scriptures with Patchworkid's Study Bible Set<p>http://www.biblesupp...tudy-bible-set/, MySword -http://www.biblesupp...tudy-bible-set/

#19 dyan

dyan

    e-Sword Addict

  • Veterans
  • PipPipPipPip
  • 89 posts
Offline

Posted 08 July 2011 - 08:32 AM

I don't know if this is the right place to ask, but where can i find a macro to convert roman numerals to normal numbers?

#20 patchworkid

patchworkid

    Resource Builder

  • Members (T)
  • PipPipPipPipPip
  • 1,554 posts
  • LocationOld England
Offline

Posted 08 July 2011 - 08:44 AM

I don't know if this is the right place to ask, but where can i find a macro to convert roman numerals to normal numbers?


Hi I know that Tool Tip Tool has it. load it in there as RTF and check out David Cox

thanks
Merismos the Scriptures with Patchworkid's Study Bible Set<p>http://www.biblesupp...tudy-bible-set/, MySword -http://www.biblesupp...tudy-bible-set/




0 user(s) are reading this topic

0 members, 0 guests, 0 anonymous users




Similar Topics



Latest Blogs