Data Mining

  • Most Topular Stories

  • Forget Murder, Gun Ownership More Related to Suicides/Accidents

    Data Science Notes
    26 Aug 2015 | 9:19 pm
    Another day, and another shooting is the top news story.  The shootings in the news constantly inspired me to download gun and shooting data, and what I found was highly surprising, yet depressing.As a quick piece of personal background, I know two people who have been shot in the last 18 months.  And it's not like I am living a hardcore life to know people who have been shot. I'm an analytics manager at a financial company. I am also a gun owner. BACKGROUNDI've looked at mass shooting data before, but mainly just the twitter conversations in the weeks after the shootings.
  • Big Bigger Biggest Data in Oil & Gas

    TIBCO Spotfire's Trends and Outliers
    Cloudera
    27 Aug 2015 | 7:46 am
    For decades the oil & gas industries have used advanced analytics to attempt to understand the earth’s subsurface. And now with the advent of ‘Big Data’ and the ‘Internet of Things’ the Oil & Gas Industries are primed to make revolutionary advances in how they use data to manage and grow their business. The first wave of ‘Big Data’ was more or less about people communicating with other people—email, chat, social media, video, etc. This next wave is really driven by much of the work being done in oil & gas will be a tsunami in comparison, as it’s driven…
  • Tips for Better Content Marketing Strategy with Data Visualization

    TIBCO Spotfire's Trends and Outliers
    Spotfire Blogging Team
    24 Aug 2015 | 6:27 am
    In a world driven by data, marketers are finding new and better ways to present data to make it more visually appealing—and this data visualization strategy is invaluable to content marketing approaches because it makes the data easily consumable, notes Tony Adragna in an article in Business 2 Community. Adragna offers some tips to help marketers better use data visualization in their content strategies: 1. Only Use Accurate Data—If you want to get the appropriate insights from your data, you have to ensure that your data reliable and you can trust it. That means you have to carefully…
  • Playing With Numbers: A Fantasy Football Tip

    The Numbers
    Brian Hershberg
    19 Aug 2015 | 10:27 am
    With the first batch of the National Football League's preseason games in the books, it's that time for every armchair general manager to begin assessing the draft, keepers and a strategy for the season. Thanks to a new tool, we can incorporate injury risk into our planning.
  • Congressional Candidates Wasted $240 Million on TV in 2014

    Latest articles from Direct Marketing News
    Direct Marketing News
    27 Aug 2015 | 12:47 pm
    It's difficult for politicians to be direct about anything, but the smart ones will use direct methods to wring value from their marketing budgets in 2016.
 
  • add this feed to my.Alltop

    The Numbers

  • Playing With Numbers: A Fantasy Football Tip

    Brian Hershberg
    19 Aug 2015 | 10:27 am
    With the first batch of the National Football League's preseason games in the books, it's that time for every armchair general manager to begin assessing the draft, keepers and a strategy for the season. Thanks to a new tool, we can incorporate injury risk into our planning.
  • Reader Mailbag: Crime and Employment

    Jo Craven McGinty
    17 Aug 2015 | 10:34 am
    Worker advocates who support ban-the-box initiatives aimed at removing questions about criminal history from job applications fear the information will prejudice employers against capable employees who may have run afoul of the law. Dave's Killer Bread welcomes workers with rap sheets.
  • Behind The Numbers: Some Long Odds

    Jo Craven McGinty
    14 Aug 2015 | 10:12 am
    What are the odds?
  • Behind The Numbers: Crime Data

    Jo Craven McGinty
    7 Aug 2015 | 8:38 am
    According to multiple studies, roughly 1 in 3 American adults has been arrested or taken into police custody for something other than a minor traffic offense. But none of the data sets is complete, and some include duplicate or defunct records.
  • 50 Cent Bankruptcy: By The Numbers

    Katy Stech
    4 Aug 2015 | 7:17 am
    50 Cent filed 56 pages of financial information that’s required as part of his bankruptcy case. Find out whether the 40-year-old entertainer, whose real name is Curtis James Jackson III, is really as rich as he claims to be.
  • add this feed to my.Alltop

    Kevin Hillstrom: MineThatData

  • Retail Interest Payments

    Kevin Hillstrom
    27 Aug 2015 | 8:10 pm
    Have you ever looked at the profit and loss statement of your favorite retail brand?Click here and take a look at Macy's.In the past five years, they've paid $2,200,000,000 (yes, 2.2 billion) in interest ... INTEREST!Think of it this way ... every time you waltz into your favorite Macy's and spend $100, banks get to take $1.40 right off the top ... that's how the math works. Heck, with a gross margin in the 36% range, the math is even more extreme. When you spend $100, Macy's gets to keep $36 ... and then the banks take $1.40 off the top of the $36 that remain, or 4%.That, my friends, is what…
  • Burning Man & Home Depot

    Kevin Hillstrom
    26 Aug 2015 | 8:10 pm
    This reads a lot like a paid ad, of course, but what the heck, click here.Now, if you worship at the altar of personalization, then this is for you ... no, not the "Dear Kevin, Here Is 20% Off Because You Previously Bought Widgets" digital nonsense that some demand you adopt because some make money when you adopt their ethos.No, this is merchandise personalization.In an omnichannel world, you are told you must have the same products at the same prices across all channels. Sameness. You are told that "the customer" demands it.In the real world, you run a local shop. You tailor the assortment…
  • Authentic And Meaningful Relationships

    Kevin Hillstrom
    26 Aug 2015 | 8:56 am
    Chief Merchant is hosting a webinar about how the Atlanta Falcons. One of the sentences is particularly meaningful ... "where brands are creating authentic and meaningful relationships." More on that in a moment.Let's look at Atlanta's record the past two seasons:2013 = 4 wins, 12 losses.2014 = 6 wins, 10 losses.In other words, the merchandise is failing. The team has won 31% of their games over the past two seasons ... following five consecutive highly successful, winning seasons. That's a difficult transition ... anybody who loves a sports team knows what this feels like (#notgood).So, the…
  • When Marketing And Testing Interact - Panera Bread

    Kevin Hillstrom
    25 Aug 2015 | 8:10 pm
    This is always tough ... marketing wants to perform a marketing tactic, and somebody else wants to test a strategy (often the marketing team).This image represents a marketing tactic ... use your mobile device (#mobilefirst) to order food ahead of time, and receive $5 off for choosing this tactic.In a marketing world where you don't test anything, this is fine. The tactic works, or the tactic does not work.In the real world, this tactic decomposes into a series of questions that somebody will want answered.Will a customer switch to mobile once awareness of the channel has been achieved?Does…
  • When Merchandise And Data Driven Logic And Technology Meet

    Kevin Hillstrom
    24 Aug 2015 | 8:10 pm
    Two stories to enhance your Tuesday experience.Story #1:  A check engine light comes on. The computer suggests that there is a problem with my Diesel Exhaust Fluid. That's curious, because 30 miles earlier, I drove the vehicle away after an oil change that included 2.5 gallons of Diesel Exhaust Fluid being added to a 4.0 +/- gallon tank. This vehicle has 4,100 miles on it ... so whew, it's covered under the warranty, right? So I take it to the dealer. The check engine code is read. Here's the discussion that follows.Dealer: Mr. Hillstrom, you may have voided your warranty.Kevin:…
 
  • add this feed to my.Alltop

    TIBCO Spotfire's Trends and Outliers

  • Big Bigger Biggest Data in Oil & Gas

    Cloudera
    27 Aug 2015 | 7:46 am
    For decades the oil & gas industries have used advanced analytics to attempt to understand the earth’s subsurface. And now with the advent of ‘Big Data’ and the ‘Internet of Things’ the Oil & Gas Industries are primed to make revolutionary advances in how they use data to manage and grow their business. The first wave of ‘Big Data’ was more or less about people communicating with other people—email, chat, social media, video, etc. This next wave is really driven by much of the work being done in oil & gas will be a tsunami in comparison, as it’s driven…
  • 13th Annual Houston Energy Forum SOLD OUT!

    Spotfire Blogging Team
    26 Aug 2015 | 5:55 am
    THE 2015 TIBCO ENERGY FORUM is now sold out!! On September 1 and 2, TIBCO will bring together industry luminaries from all segments of the energy industry for our 13th Annual Houston Energy Forum where attendees will network, listen to industry thought leaders, and learn best practices directly from TIBCO customers, partners and subject matter experts. This must-attend event, taking place at the Norris Conference Center near downtown Houston, will spotlight executive speakers, a wide range of customer presentations, and breakout sessions featuring real-world use cases and best practices…
  • Tips for Better Content Marketing Strategy with Data Visualization

    Spotfire Blogging Team
    24 Aug 2015 | 6:27 am
    In a world driven by data, marketers are finding new and better ways to present data to make it more visually appealing—and this data visualization strategy is invaluable to content marketing approaches because it makes the data easily consumable, notes Tony Adragna in an article in Business 2 Community. Adragna offers some tips to help marketers better use data visualization in their content strategies: 1. Only Use Accurate Data—If you want to get the appropriate insights from your data, you have to ensure that your data reliable and you can trust it. That means you have to carefully…
  • How SumAll.org Puts Analytics in Good Hands

    Ann Scheuerell
    20 Aug 2015 | 6:22 am
    The right data can help nonprofits in two major ways. It can assist with funding by illustrating to people how much help is needed, and it can support developing more effective strategies for achieving goals. “At SumAll.org, we’re working on making enterprise-scale problem-solving accessible to nonprofits. When a solution is easy to use and flexible, it can have transformational power,” says Stefan Heeke, SumAll.org executive director. Though nonprofits want data, they don’t always have the ability to get it. “When we start a project to help a nonprofit, we sometimes see a…
  • Big Data’s Companion: Wide Data

    Spotfire Blogging Team
    19 Aug 2015 | 7:40 am
    If you’ve worked with Big Data and analytics for long, you’re probably aware of the challenges of mixing structured and unstructured data. However, there are powerful insights that can be gathered when mixing these two types of data—sometimes called Wide Data. What Is Wide Data? Imagine you have enterprise-structured data on SQL servers that contain your organization’s internal information about each of your clients. However, you also have emails, social media (Facebook, LinkedIn, Twitter, YouTube, etc.), Word docs, audio files, and more that also pertain to your clients. This…
 
  • add this feed to my.Alltop

    PolicyMap

  • Understanding FEMA Disaster Declarations

    Morgan Robinson
    26 Aug 2015 | 2:11 pm
    Natural disasters are “in” right now. Hurricane season is picking up, people in New Orleans and across the globe are commemorating the tenth anniversary of Hurricane Katrina, and residents of the West Coast are growing increasingly anxious about the next big earthquake. A recent feature in the New York Times visualizes the physical and economic impact of natural disasters throughout the country. We’ve added the locations of federally-declared disaster areas to PolicyMap, and we hope this new data makes it easier to look at natural and man-made disasters in a policy context.
  • Low Income Housing Tax Credit (LIHTC) Update

    Aaron King
    21 Aug 2015 | 1:49 pm
    The Low-Income Housing Tax Credit (LIHTC) program is a federal program that has been implemented over the last three decades to encourage the acquisition, rehabilitation, and new construction of rental housing targeted to lower-income households. Since its inception in 1986, LIHTC has contributed to the leveraging of nearly $100 billion in private investment capital, resulting in the financing of almost 2.8 million housing units. It is one of the nation’s most critical tools for creating and rehabilitating affordable housing. LIHTC was originally established by the U.S. Internal Revenue…
  • It’s a Papal Party in the USA

    Aaron King
    18 Aug 2015 | 2:27 pm
    Philadelphia is abuzz in anticipation of Pope Francis I’s September visit to the United States. Pope Francis is visiting the city as part of a six-day trip to the United States, culminating in the World Meeting of Families in Philadelphia on September 26th-27th. In recent months the news has been about the logistical aspects of the Pope’s visit as the estimated number of religious pilgrims to visit Philadelphia is expected to exceed almost 1.5 million people. One notable logistical point is that many of the roads going in and out of Philadelphia’s Center City and the adjoining West…
  • Congratulations and Good Luck, Katie Nelson!

    Maggie McCullough
    13 Aug 2015 | 7:45 am
    I cannot believe this is Katie’s last full week with us.  I cannot believe we won’t see her smiling, intelligent face every day here at PolicyMap.  I cannot believe I actually offered to write this blog post… Katie Nelson – one of our earliest hires at PolicyMap – is leaving us.  Sure, she is leaving us to attend Rutgers University’s Bloustein School to earn her PhD.  Sure, she received a full scholarship.  Sure, it is an amazing path for her future.  But we can’t help ourselves.  We behave like small children when we whine “what about us?” Katie has been instrumental…
  • Color Me Curious: Why PolicyMap Maps are Purple

    Bridget Kane
    10 Aug 2015 | 2:53 pm
    Data visualization is a powerful tool—colors can provide very meaningful context, but they can just as easily make for a very misleading representation of your information. There are a few things to think about when choosing colors to symbolize your data, namely, color properties, connotations, and relationships to other objects or features on the ground. Munsell’s cylindrical arrangement of colors: the horizontal line represents chroma (saturation), the vertical line represents value (lightness) and the circle represents hue. The technical portion of color choice deals with how colors…
  • add this feed to my.Alltop

    Revolutions

  • Because it's Friday: Downtown Spokane

    David Smith
    28 Aug 2015 | 12:20 pm
    There hasn't really been a break-out Summer hit this season, but Macklemore's latest — an ode to mopeds, of all things — might just be it (some language NSFW): Loving the 70's styling and dance moves. And despite the Seattle-focused lyrics, Spokane makes a great backdrop for the music video. That's all for this week. See you back here on Monday!
  • Get to know Cortana Analytics: Workshop and webinars

    David Smith
    28 Aug 2015 | 8:12 am
    Cortana Analytics Suite is Microsoft's cloud-based big data and advanced analytics suite. It includes a complete set of all the services need to build advanced analytics applications: from data ingestion and management, data warehousing, advanced analytics, data visualization and solution frameworks. You can use Cortana Analytics to build applications using R, by incorporating services including Data Factory, HDInsights Hadoop, and ML Studio. If you'd like to spend some quality time with other developers and the Microsoft development team, there will be a first-ever Cortana…
  • Plotting Time Series in R using Yahoo Finance data

    Joseph Rickert
    27 Aug 2015 | 8:30 am
    by Joseph Rickert I recently rediscovered the Timely Portfolio post on R Financial Time Series Plotting. If you are not familiar with this gem, it is well-worth the time to stop and have a look at it now. Not only does it contain some useful examples of time series plots mixing different combinations of time series packages (ts, zoo, xts) with multiple plotting systems (base R, lattice, etc.) but it provides an instructive, historical perspective that illustrates the non linear nature of progress in software development: new code is written to solve certain technical problems with the…
  • New R titles available in Chinese

    Andrie de Vries
    26 Aug 2015 | 7:00 am
    by Andrie de Vries A few days ago I received an unexpected parcel in my letterbox. To my delight, it turned out to be a translation into Simplified Chinese of R for Dummies, co-authored by myself and Joris Meys. Let me clarify: Joris and I wrote the book, but were not involved in the translation at all. The Wiley publishing machine took care of all of that. Just to prove that this is real, here is a picture of my copy, unfortunately slightly damaged during the delivery process: I find the decision by Wiley to translate into Chinese very interesting, since it means there is a growing market…
  • Following up on news stories with choroplethr and R

    Joseph Rickert
    25 Aug 2015 | 8:30 am
    by Ari Lamstein, consultant specializing in software engineering and data analysis and author of the free email course Learn to Map Census Data in R One of my favorite things about R is that it allows me to follow up on interesting news stories. Consider this interview on EconTalk about the history of fracking in America. Russ Roberts interviewed Gregory Zuckerman about his book The Frackers. One thing that struck me were the stories of how North Dakota is being transformed by the fracking boom. North Dakota sits on the Bakken formation which, due to fracking, is now able to be monetized.
  • add this feed to my.Alltop

    Data Science Notes

  • Kansas Election Fraud, Part 4

    28 Aug 2015 | 1:44 pm
    I was minding my business yesterday afternoon, when I was approached by someone on the internet with data I might be interested in.  I was half-expecting Ashley Madison or other scandalous data, but what I was presented with, was much, much juicier: election data.  Kansas Governor's Race 2014.  This is interesting data, given the current debate over election fraud in Kansas.  Also, for some reason, this data has not yet been posted to the Secretary of State's website (I would soon find out it was submitted by counties in poor formats, so maybe this is explainable).If…
  • Forget Murder, Gun Ownership More Related to Suicides/Accidents

    26 Aug 2015 | 9:19 pm
    Another day, and another shooting is the top news story.  The shootings in the news constantly inspired me to download gun and shooting data, and what I found was highly surprising, yet depressing.As a quick piece of personal background, I know two people who have been shot in the last 18 months.  And it's not like I am living a hardcore life to know people who have been shot. I'm an analytics manager at a financial company. I am also a gun owner. BACKGROUNDI've looked at mass shooting data before, but mainly just the twitter conversations in the weeks after the shootings.
  • Comparing Sentiment: #ksleg v #ksed v #Royals

    25 Aug 2015 | 1:14 pm
    Yesterday's post was fairly popular (quite a few page views) so I thought I would extend it a bit.  What about comparing sentiments over various topics rather than over time?  It's easy to point my code at three separate topics on twitter, use my emotion/polarity identifying algorithm on each set, and compare the results.  I also pull out a couple of current events references below.METHODOLOGYTechnology-wise, I'm using the same methodology I've implemented for my past few posts.For practical methodology, I pointed the algorithm at three separate hashtags that are used quite a…
  • Sentiment Mining #Royals Over Time

    24 Aug 2015 | 2:32 pm
    I have been analyzing political/policy data over the past few weeks using my sentiment mining algorithm, and thought it would be interesting to look at some sports Twitter data.  Fortunately, the Royals are good again this year, and approaching the end of the season, so I have a good local target.  A few pre-findings of interest:Royals tweets are much more positive than political tweets in general.Royals fans are most negative DURING games, and more positive when games are not being played.Royals fans are also less "sad" during games, though more likely to be angry or disgusted when…
  • #ksleg Friday Twitter Power Rankings

    22 Aug 2015 | 7:04 am
    And it's that time of the week again for the formulaic #ksleg power rankings.  I think I can almost automate this completely at this point, but here are our three steps:TOPIC MININGWhat topics were discussed this week?  Here's a word cloud to get us started:This was a relatively low-volume week on the #ksleg hashtag, and our topic mining only found a few distinct topics:Topic 1: Kansas schools seeking more aid for education, and having to document their efficiency.Topic 2: A former employee of the Kansas Secretary of State's office was reportedly fired for not attending church.Topic…
 
  • add this feed to my.Alltop

    Data Science Notes

  • Kansas Election Fraud, Part 4

    28 Aug 2015 | 1:44 pm
    I was minding my business yesterday afternoon, when I was approached by someone on the internet with data I might be interested in.  I was half-expecting Ashley Madison or other scandalous data, but what I was presented with, was much, much juicier: election data.  Kansas Governor's Race 2014.  This is interesting data, given the current debate over election fraud in Kansas.  Also, for some reason, this data has not yet been posted to the Secretary of State's website (I would soon find out it was submitted by counties in poor formats, so maybe this is explainable).If…
  • Forget Murder, Gun Ownership More Related to Suicides/Accidents

    26 Aug 2015 | 9:19 pm
    Another day, and another shooting is the top news story.  The shootings in the news constantly inspired me to download gun and shooting data, and what I found was highly surprising, yet depressing.As a quick piece of personal background, I know two people who have been shot in the last 18 months.  And it's not like I am living a hardcore life to know people who have been shot. I'm an analytics manager at a financial company. I am also a gun owner. BACKGROUNDI've looked at mass shooting data before, but mainly just the twitter conversations in the weeks after the shootings.
  • Comparing Sentiment: #ksleg v #ksed v #Royals

    25 Aug 2015 | 1:14 pm
    Yesterday's post was fairly popular (quite a few page views) so I thought I would extend it a bit.  What about comparing sentiments over various topics rather than over time?  It's easy to point my code at three separate topics on twitter, use my emotion/polarity identifying algorithm on each set, and compare the results.  I also pull out a couple of current events references below.METHODOLOGYTechnology-wise, I'm using the same methodology I've implemented for my past few posts.For practical methodology, I pointed the algorithm at three separate hashtags that are used quite a…
  • Sentiment Mining #Royals Over Time

    24 Aug 2015 | 2:32 pm
    I have been analyzing political/policy data over the past few weeks using my sentiment mining algorithm, and thought it would be interesting to look at some sports Twitter data.  Fortunately, the Royals are good again this year, and approaching the end of the season, so I have a good local target.  A few pre-findings of interest:Royals tweets are much more positive than political tweets in general.Royals fans are most negative DURING games, and more positive when games are not being played.Royals fans are also less "sad" during games, though more likely to be angry or disgusted when…
  • #ksleg Friday Twitter Power Rankings

    22 Aug 2015 | 7:04 am
    And it's that time of the week again for the formulaic #ksleg power rankings.  I think I can almost automate this completely at this point, but here are our three steps:TOPIC MININGWhat topics were discussed this week?  Here's a word cloud to get us started:This was a relatively low-volume week on the #ksleg hashtag, and our topic mining only found a few distinct topics:Topic 1: Kansas schools seeking more aid for education, and having to document their efficiency.Topic 2: A former employee of the Kansas Secretary of State's office was reportedly fired for not attending church.Topic…
Log in