Useful Links

Books

The workshop presentations inevitably assume some familiarity with content analysis methods and terminology. These books provide helpful overviews of the general approach as well as more specific elements.

  • Klaus Krippendorff (2004). Content Analysis: An introduction to its
    methodology, 2nd: Thousand Oaks, CA: Sage.
  • Kimberly A. Neuendorf (2002). The Content Analysis Guidebook. Thousand
    Oaks, CA: Sage.
  • Robert Philip Weber (1990). Basic Content Analysis, 2nd. Thousand Oaks,
    CA: Sage.
  • Bo Pang and Lillian Lee, Opinion Mining and Sentiment Analysis (2008). http://www.cs.cornell.edu/home/llee/omsa/omsa-published.pdf

Software

These software packages listed here are referenced in the workshop.

Software for preparing text as data

  • Outwit Hub is a free scraper utility for Mozilla. For the right tasks it works very well (register for the ‘Pro’ beta).
  • cURL retrieving internet files
  • Wget retrieving internet files
  • Will Lowe’s software page includes a number of helpful tools including ykconverter (converts files to raw text), jfreq (word counts) and yoshikoder (build unigram dictionaries).
  • tm (R package that includes tools for prepping text as data)
  • Excel offers some very sophisticated functions for manipulating text.  Here too.

Dictionary/NLP Resources

  • General Inquirer
  • Phil Schrodt’s PSEDP project (formerly KEDS)
  • WordNet
  • TextPack is a commercial text analysis package for analyzing word frequencies and developing and applying user constructed dictionaries. The student price is about $100.

Supervised Learning Resources

Data

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s