Parts of Speech Tagging N-Grams

ch03-tree-1Part of my thesis is to analyze a set of SMS messages using Parts of Speech Tagging. I wanted to see what POS would come up with given SMS messages to analyze. It was interesting. For the most part POS could successfully tag each of the SMS messages properly but was curious to see if there was a higher rate of a specific noun/verb/pronoun/etc usage in SMS messages. This post isnt about the outcome but rather a forum to place the POS N-grams out.

Some background
Before I could do anything I wanted to see if someone had already done this. I wanted to answer, Are there any N-grams for parts of speech tagging specifically for SMS text? Turns out I was either not looking in the correct place or there are non. So, I decided to create my own using the SMS data set located here: http://wing.comp.nus.edu.sg:8080/SMSCorpus/data/corpus/smsCorpus_en_xml_2012.04.30.zip

Creating the POS n-grams.
To create the POS n-grams I used the Treebank created by TweetNLP and then ran the output through some formatting of my own.

The files!
I now present you 2-gram, 3-gram, and 4-gram files based off of the SMS messages.

Download – 2 Gram
Download – 3 Gram
Download – 4 Gram

Armando Padilla

The challenge

It wasn’t until I left the meeting, came home, watched my wife put Amanda to sleep , and went into the home office that I started to think about what happened that morning and the conclusion that I came to. Having teams, divisions, companies reach goals while there are many negative factors is a high accomplishment.

I met an old coworker that morning heard the great things he was working on, the plans, the ambitious vision (which i do think he’ll complete), and thought to myself. I’m doing it wrong. And, What I wouldn’t give. I realized that the company he’s helped build operated with a different mentality and it was do to this mentality that it became and is successful. I guess you can say, I miss it.

Hitting reality
Reality. The reality of the situation though, my situation, is different. Though I wish my teams under me reach the point his entire company runs under reach and I reach the point where I think and plan like he does, I realize I’ve had success with a very critical component which he might never encounter do to the culture of the company he runs. How to reach goals, milestones, when not all pieces either cant or wont operate at 100% and you have no option but to keep the problems is a challenge.

So far, the formula i’ve implemented is and has shown results. Some believe it to be me putting in long hours (it’s not), or me doing the work for my teams (it’s not), it’s simply observing, looking at the process, and identifying pain points. Yes some times the pain points are personnel based but most of the time it’s not. Yes it does bring unwanted stress since I cant simply move the problem out of the situation like my friend can but it helps me work with what I have to the highest degree.

What am I getting at?
Working and delivering results in a non ideal environment is a great challenge that not many people can succeed at. Though I sometimes wish I had the wiggle room to remove continued problems knowing that results were attained under non ideal conditions is great.

The challenge? Not to get discouraged.

Armando Padilla