Skip to main content

Editing Lengthy Texts and Notes.

I am currently working on an assignment where the students in my class are expected to go through a book and make a list of all the errors we can find in the book. Well, to be fair, it's actually 10 chapters of a book and ~20 pages on an average per chapter. Well, the lazy engineering students that we are, we formed a group and divided work/chapters among ourselves so that none of us will have to read the whole thing but all of us get full credit. BTW, this is for a humanities course titled 'Environmental and Resource Economics' and the textbook we are expected to go through is called 'Energy and the Wealth of Nations'.

And in this process, i came upon one interesting question.

As i was reading the chapters, i noticed how hard it was for me to keep track of the errata.
As it is a book, the whole thing is written in chapters and pages and if i do find a mistake, i'd have to note down the # of the page, the paragraph in the page and the sentence in that particular paragraph. And because this is not a literary piece of work but a (partially) scientific one, most of the errata i find are problems with sentence formations. And there are a lot of them.

This got me wondering as to how actual textbooks are edited. Or even notes pertaining to a certain course which a professor might post online. As far as i know, books are written one paragraph at a time, not one sentence after another. It does not make sense for a writer to break his line of thought by writing his piece in individual sentences. Even the reader will have trouble understanding a book with individual sentences.

What i'm trying to say is that, instead of writing the paragraph above, i could've done this -

  1. This got me wondering how actual textbooks are edited. 
  2. Or even notes pertaining to a certain course which a professor might post online. 
  3. As far as i know, books are written one paragraph at a time and not one sentence after another. 
  4. It does not make sense for a writer to break his line of thought by writing his piece in individual sentences. 
  5. Even the reader will have trouble understanding a book with individual sentences. 

Now, reading this will not have the same effect as reading the paragraph. But we want to make editing a book easy, not reading one. And if the whole book were written in sentences, then someone expected to go through the book will easily be able to point out the mistakes. You can just note down the sentence number and you're done, instead of telling him the page #, paragraph and sentence #  i.e you could've said that there was a mistake in sentence no.4 than saying that there was a mistake in the new sentence starting in the 3rd line of the paragraph. 

Granted, writing a book this way will be hard. But i'm sure a small piece of code will help in breaking a given paragraph into sentences so as to help with the editing process. 
We'd basically be looking for a fullstop and everytime we find one, we insert a line break before the fullstop, while taking ellipsis - '...' into consideration.  
Numbering the sentence should'nt be a hard task as we'd just be inserting a number after the line break. The number is the value of a counter and the value of the counter increases by 1 everytime we have a linebreak. This way, we can easily convert a given paragraph into numbered sentences, making the editing process easy. 

I wonder if a small shell script can do this. 
Let's see...

An Update - So, friend of mine shared a couple of links that are kinda related to this article and the topic... 
Proof Reading Guidelines, a collection of rules and guidelines one should use to proofread an article or a book. 
One Sentence per Line, Why I Two Space are a couple of interesting articles on typography. 

Popular posts from this blog

Animation using GNUPlot

Animation using GNUPlotI've been trying to create an animation depicting a quasar spectrum moving across the 5 SDSS pass bands with respect to redshift. It is important to visualise what emission lines are moving in and out of bands to be able to understand the color-redshift plots and the changes in it.
I've tried doing this using the animate function in matplotlib, python but i wasn't able to make it work - meaning i worked on it for a couple of days and then i gave up, not having found solutions for my problems on the internet.
And then i came across this site, where the gunn-peterson trough and the lyman alpha forest have been depicted - in a beautiful manner. And this got me interested in using js and d3 to do the animations and make it dynamic - using sliders etc.
In the meanwhile, i thought i'd look up and see if there was a way to create animations in gnuplot and whoopdedoo, what do i find but nirvana!

In the image, you see 5 static curves and one dynam…

on MOOCs.

For those of you who don't know, MOOC stands for Massively Open Online Course.

The internet is an awesome thing. It's making education free for all. Well, mostly free. But it's surprising at the width and depth of courses being offered online. And it looks like they are also having an impact on students, especially those from universities that are not top ranked. Students in all parts of the world can now get a first class education experience, thanks to courses offered by Stanford, MIT, Caltech, etc.

I'm talking about MOOCs because one of my new year resolutions is to take online courses, atleast 2 per semester (6 months). And I've chosen the following two courses on edX - Analyzing Big Data with Microsoft R Server and Data Science Essentials for now. I looked at courses on Coursera but I couldn't find any which was worthy and free. There are a lot more MOOC providers out there but let's start here. And I feel like the two courses are relevant to where I …

Pandas download statistics, PyPI and Google BigQuery - Daily downloads and downloads by latest version

Inspired by this blog post :, I wanted to play around with Google BigQuery myself. And the blog post is pretty awesome because it has sample queries. I mix and matched the examples mentioned on the blog post, intent on answering two questions - 
1. How many people download the Pandas library on a daily basis? Actually, if you think about it, it's more of a question of how many times was the pandas library downloaded in a single day, because the same person could've downloaded multiple times. Or a bot could've.
This was just a fun first query/question.
2. What is the adoption rate of different versions of the Pandas library? You might have come across similar graphs which show the adoption rate of various versions of Windows.
Answering this question is actually important because the developers should have an idea of what the most popular versions are, see whether or not users are adopting new features/changes they provide…