Blogs

  • News
  • |
  • Art
  • |
  • History
  • |
  • Food and Travel
  • |
  • Science
SmartNews

Keeping You Current

Around the Mall

Scenes and sightings from Smithsonian museums and beyond


August 27, 2012 10:45 am

Data Mining the Classics Clusters Women Authors Together, Puts Mellville Out On a Raft

Photo: poetas

Can computers analyze literature? Depends on who you ask. Some literary types are taking offense at a new statistical tool that mines and clusters classic works. But it was one of their own – English professor Matthew Jockers of the University of Nebraska-Lincoln – who devised the new supercomputer-mediated literary analysis. Jockers’ macroanalysis method compares thousands of books in order to identify systems of influence, school of thought or other groupings that human scholars might have missed.

“We need to go beyond our traditional practice of close reading and go out to a different scale,” he told NBC News. “The traditional practice of close reading allows us to look at the bark on the trees, while the macroanalytic allows us to see the whole forest.”

Jockers analyzed thousands of books from the late 18th to 19th centuries for their punctuation, word choice and overarching theme. The results give him a “book signal” to allow each work to be compared and plotted alongside others. Melville, apparently, warrants his own aquatically-themed cluster.

Photo: Matthew Jockers / University of Nebraska-Lincoln

A few patterns emerged. Female authors, for example, were grouped together although the computer did not take their gender into account when placing them. This shows that, on a whole, female authorship is detectable by objective measures rather than just human intuition.

The darker-colored areas represent groups of women authors. Photo: Matthew Jockers / University of Nebraska-Lincoln

While some scholars feel threatened by the new method, Jockers points out that his high level approach could lend new perspective and prompt fresh investigation into the classics and other literary works. And while his analysis reveals trends such as the female authorship clustering, it does not tease out some intricacies better left to human minds. For example, a few of the best-known works by women, like Jane Austen’s greatest hits, were not nestled in the female-clustered group. Pointing that out and examining the meaning behind it is a job best done by humans, he says.

More from Smithsonian.com:


***

Sign up for our free email newsletter and receive the best stories from Smithsonian.com each week.

No Comments »

No comments yet.

RSS feed for comments on this post. TrackBack URI

Leave a comment

Comments are moderated, and will not appear until Smithsonian.com has approved them. Smithsonian reserves the right not to post any comments that are unlawful, threatening, offensive, defamatory, invasive of a person's privacy, inappropriate, confidential or proprietary, political messages, product endorsements, or other content that might otherwise violate any laws or policies.

Spam protection by WP Captcha-Free

Advertisement



Trending Today New Research Cool Finds

Follow Us

Travel with Smithsonian






Advertisement