Book review: Pro Perl Parsing by Christopher M. Frenz
Parsing by example
Download the whole article as PDF
Short URL: http://fsmsh.com/1734
- 2006-09-21
- Published on web | Easy
-
Write a full post in response to this!
Pro Perl Parsing is a well-written tome on the subject of various ways of pulling information out of sources such as the HTML, RSS, XML, CSV, the command line and text. More correctly put, the book discusses the extraction of data and some analysis via Perl. The author Christopher M. Frenz understands the value of using CPAN modules and describes parsing through pleasantly brief code examples.
My first impression of the book was one of to-the-core efficiency. Pro Perl Parsing defines that which is required to understand concepts such as context free grammar, regular expression pattern matching and even manages to squeeze in a little data mining.
Accurate and focused, this book does not have an ounce of fat. The author Christopher M. Frenz understands the value of using CPAN modules.
My first impression of the book was one of to-the-core efficiency
The contents
Frenz’s well-balanced book packs much compressed information into its relatively short 252 pages. He starts with a “show us the money” chapter on the important subject of parsing and regex expressions and then quickly moves through details of how grammar parsers actually work. I got the distinct impression of mentally flowing seamlessly to the main points and then having the author deal accurately with the required background concepts.
There was a slight slow down when hitting chapters four and five as the chapters where harder to digest due to the underlying complexities of correctly configuring Parse::Yapp and Parse::RecDescent. The story speeds up again after this and I encountered the fun chapters on pulling information from the internet and data mining. I particularly liked example 6-1 (page 147), which, within a few lines of easy to understand code, pulled and processed the temperature for a particular location from Weather.com.
Sadly for me, I have to admit that, despite thinking I knew much in this subject area, I managed to pick up a trick or two from Pro Perl Parsing. Data mining is a pet love for me and chapter ten delivered a couple of practical hints that I will add to some of my prototype code later. For example, listing 10-1 (page 221) explains briefly the use of Statistics::Descriptive for finding the mean and standard deviation of a particular dataset. If I had known of this module earlier, I would not have previously written a number of custom functions myself (and on more than one occasion).
I managed to pick up a trick or two from Pro Perl Parsing
Who’s this book for?
This book has a number of target audiences. Perl scripting system administrators may gain some insight into improving parsing of command line options or configuration files. Scientists, teachers or students should find value in data manipulation and mining.
Relevance to free software
Since its inception in 1987, Perl has been free and has accumulated a particularly large following of addicted developers. This thoroughly RAD scripting language has been ported to possibly more platforms than Java. Pro Perl Parsing addresses one of its significant strengths, as Perl is well known as the main language for report generation and ad hoc data parsing.
This quick to script language has great influence on the proper running of many Linux servers. If you look around a typical server, you will find code for log rotation, report generation, general cleanup scripts and a number of command line tools. Even in this mundane everyday environment an understanding of Perl parsing is at the least handy.
Looking further afield, programming of data parsing and crunching is a fundamental skill for most scientists who coincidentally need to choose from proprietary or open languages or tools. Therefore, this book is helpful to the momentum of the free software movement at a number of levels.
Pros
Accurate and to the point, this book describes the parsing of data via CPAN modules correctly and without fuss. A good solid read and not too thick in pages to be off-putting.
Cons
If you are looking for a hard core Perl regex book then you may find this book light on the subject. However, you may be surprised by how relevant parsing is to your problem domain. Therefore, please read the table of contents before passing this one over.
| Title | Pro Perl Parsing |
| Author | Christopher M. Frenz |
| Publisher | Apress |
| ISBN | 1590595041 |
| Year | 2005 |
| Pages | 272 |
| CD included | No |
| FS Oriented | 8 |
| Over all score | 9 |
In short
Write a full post in response to this!
Similar articles
Do you like this post?
Vote for it!
Copyright information
This article is made available under the "Attribution" Creative Commons License 3.0 available from http://creativecommons.org/licenses/by/3.0/.
Biography
Alan Berg Bsc. MSc. PGCE, has been a lead developer at the Central Computer Services at the University of Amsterdam for the last eight years. In his spare time, he writes computer articles. He has a degree, two masters and a teaching qualification. In previous incarnations, he was a technical writer, an Internet/Linux course writer, and a science teacher. He likes to get his hands dirty with the building and gluing of systems. He remains agile by playing computer games with his kids who (sadly) consistently beat him physically, mentally and morally.
You may contact him at reply.to.berg At chello.nl
- Login or register to post comments
- 6360 reads
- Printer friendly version (unavailable!)




Best voted contents
-
Special 301: FOSS users. Now we're all Communists and Criminals
Gary Richmond, 2010-03-05 -
Microsoft's Internet Driving Licence: stupid, unworkable and unenforceable
Gary Richmond, 2010-03-10 -
The Bizarre Cathedral - 69
Ryan Cartwright, 2010-03-12 -
The Bizarre Cathedral - 68
Ryan Cartwright, 2010-02-19
Buzz authors
Free Software news
- @the_hindu : "Towards bridging the ubiquitous digital divide" by @deepakurup http://bit.ly/cQPgMp #Bangalore #freesoftware
- Henrique Bastos no FLOSS Weekly 108: http://bit.ly/9s6r86 Muito #python #horaextra #freesoftware #fun
- #GUADEC call for participation deadline - arriving fast! http://ur1.ca/pu6o #gnome #freesoftware
- #ubuntu #linux 6 ##Ubuntu 10.04 Lucid #Lynx Wallpapers http://goo.gl/fb/1MRl #freesoftware #livecd #softwareprojects
- Revision 379: [No log message]: Changed Paths: Modify /trunk/konno-freesoftware/get_title.py… http://goo.gl/fb/JCTT
Similar entries
Other sites
- The Top 10 Everything (Dave). The good, the bad and the ugly.
- Free Software news (Dave & Bridget). All about free software -- free as in freedom!
- Book Reviews: Illiterarty (Bridget). Book reviews, blogs, and short stories.
Hot topics - last 60 days
-
Linux performance: is Linux becoming just too slow and bloated?
Mitch Meyran, 2010-01-26 -
Web code is already open - why not make it free as well
Ryan Cartwright, 2010-01-20 -
Save "Sita Sings the Blues" from the Flash format: can you convert FLA?
Terry Hancock, 2010-01-29 -
Microsoft's Internet Driving Licence: stupid, unworkable and unenforceable
Gary Richmond, 2010-03-10 -
Question Copyright's "Minute Memes" challenge copyright rhetoric
Terry Hancock, 2010-01-15
Hot topics - last 21 days
-
Microsoft's Internet Driving Licence: stupid, unworkable and unenforceable
Gary Richmond, 2010-03-10 -
Special 301: FOSS users. Now we're all Communists and Criminals
Gary Richmond, 2010-03-05 -
The Bizarre Cathedral - 69
Ryan Cartwright, 2010-03-12 -
Interview: Nina Paley (author of "Sita Sings the Blues" and the two "Minute Meme" animations)
Terry Hancock, 2010-03-15
Odiogo
Free Software Magazine uses Apollo, project management and CRM for its everyday activities!

