Category Archives: Top News

Download data to predict gender using first name (US data)

Download US American first names and initials to predict gender sex 1Do you have data with just first names or even just first initials but no information on the person's gender/sex? If you would like better insights on your customers, based on whether they are likely male or female, then this data download is a great way to maximize your ROI! Download it today and begin using it to tailor your messaging and improve future communications.

There are three licenses available for this data- individual, corporate and corporate for multi-company consumers. The individual version is available free (with discount code) for a limited time. Simply select the Individual license for purchase and use discount code discfreepers at the checkout page- this will deduct $3.99 from your purchase price.

The primary table in this data download is First names by Freakalytics with 5164 rows (distinct names and common misspellings). You can use this data to guess if someone is a male or female based on their first name or find the probability that they are male or female based on their first name.

Here is the column information and simple summaries for this table:

Data Column Max Min Average Median Mode
Name mixed case Zulma Aaron N/A N/A James
Most likely gender Male Female N/A N/A Female
Rank Overall 4,019 1 2354 2397 4019
Male Probability 100% 0% 22% 0% 0%
Female Probability 100% 0% 78% 100% 100%
Count Either Gender 99,989 32 1,079 127 32
Male Count 99,671 0 524 0 0
Female Count 83,718 0 555 64 32
Male Probability Within 3.68% 0.00% 0.08% 0.01% 0.00%
Female Probability Within 2.92% 0.00% 0.02% 0.00% 0.00%
Male Rank 1,054 1 584 608 1,054
Female Rank 3,052 1 1,825 2,131 3,052
Name first initial Z A N/A N/A J
Name upper case ZULMA AARON N/A N/A JAMES

The top few rows from this table (as a snapshot of the data in Excel 2003 format and in text):

Download US American first names and initials to predict gender sex 1

Access this valuable data download here.

Top 5 Stories Worth Reading in Data, Data Warehousing, Analytics & BI

February 23rd, 2014
Stephen McDaniel
Chief Data Officer Advisor at Freakalytics, LLC

Finding it hard to make time to keep up with the rapidly changing world of data, data warehousing, analytics, data science, business intelligence and visual analytics? We understand! Each week, I read through hundreds of stories in this space and share the five (okay, this week I couldn't stop and chose six) most worthwhile articles with you. Each article includes a summary and link to the full story. Please note that inclusion of an article does not indicate that I agree with every point in the article, but I at least find it thought-provoking and useful for informed debate.

 

 

Recruit Better Data Analysts

In the big data talent wars, most companies feel they’re losing. Marketing leaders are finding it difficult to acquire the right analytical talent. In the latest CMO Survey, only 3.4% senior marketers believe they have the right talent. Business-to-business companies have a bigger gap than business-to-consumer companies, as do companies with a lower percentage of their sales coming from the internet.  And yet analytic skill is a must for effective marketing.

Results indicate that companies with above-average marketing analytics talent experienced significantly greater rates of marketing return on investment (MROI) than companies with below average analytics talent…

 

Big Data doesn’t lie. Or does it?

If you’ve seen any indication that humans are getting smarter and more sophisticated, please inform me, and you don’t have to read what follows. For everyone else, who sees no lack of stupidity and misinformation in both business and public life, read on about the joys big data will bring.

The insights offered by analysis of big data are only as good as the human beings that create the data, gather and assemble it, decide what questions should be asked and how the data is presented. And interpreted, especially that.

I am very concerned that big data, misapplied and misunderstood, will create big lies. Or more likely a combination of lies and truths that prove very difficult to sort out.

 

What is Data Mining?

One way businesses can turn the information into something useful is through data mining. Data mining is a process used to analyze raw information to try and find useful patterns and trends in it.

"Data mining applications help users discover correlations and connections within large data sets," Software Advice writes on its website. "These might have gone unnoticed without these algorithms."

 

Scientific dashboard with periodic table of elements
Includes history, photos, great filters, discoverers and more

There is nothing we love more than sharing great examples of what is possible with visual analytics and dashboards. It’s one of the best ways to inspire new analysts and expand people’s horizons of the nearly infinite number of ways that visual analytics tools like Tibco Spotfire can be used.

Continue reading

Top News – Analysis & Commentary
Data, Data Warehousing, Analytics & BI

Five Business Intelligence Predictions
from Paxata for 2014

January 26th, 2014
Stephen McDaniel
Chief Data Officer Advisor at Freakalytics, LLC

Paxata logo blue 201401Finding it hard to make time to keep up with the rapidly changing world of data, data warehousing, analytics, data science, business intelligence and visual analytics? We understand! Here’s a top new story worth reading and that we considered noteworthy enough to add commentary and analysis by Freakalytics (in purple).  A summary of the article and excerpts that I comment on are in black.

In this commentary and analysis, we cover the growth of Tableau and QlikView, the opportunities that exist for Microsoft to disrupt the second-generation business intelligence market and how self-service data integration will likely make data scientists & data enthusiasts much more productive- enabling wide swathes of Accidental Analysts to quickly answer tactical business questions.

Five Business Intelligence Predictions For 2014 (from the CEO of Paxata)
Summary
The dust is finally beginning to clear from the big data explosion, which is a good thing. One of the problems with big data is that it’s been led by technology, not business requirements. And business requirements will be the focus in the 2014 business intelligence (BI) ecosphere—to enable enterprises to achieve results with data mining and analytics and to prove those results.

Stephen
I found this article a fascinating glimpse into the strategic thoughts of a CEO of a promising, second-generation, cloud-based data integration company- Paxata.

Continue reading

Top 5 Stories Worth Reading in
Data, Data Warehousing, Analytics & BI

January 26th, 2014
Stephen McDaniel
Chief Data Officer Advisor at Freakalytics, LLC

Finding it hard to make time to keep up with the rapidly changing world of data, data warehousing, analytics, data science, business intelligence and visual analytics? We understand! Each week, I read through hundreds of stories in this space and share the five most worthwhile articles with you. Each article includes a summary and link to the full story.

 

1-Netflix-analyticsHow Netflix Got Analytics Wrong, Then Right
Two entertaining and informative articles about Netflix illustrate how to be smart about using analytics. In one instance we see where Netflix went wrong, and in another we see Netflix doing the right thing.

Netflix launched a high-profile crowdsourcing project in 2006 to develop a better recommendation engine, offering a $1 million prize to any person or team who could improve Netflix recommendations by a modest 10 percent.

 

2-BI-dashboardSigns That Your BI Dashboard Needs a Comeback

Everyone loves a good comeback. Stories about celebrities like Robert Downey Jr. and Britney Spears climbing back to the top after falling so far capture our collective imagination. Movies like Rocky and Cinderella Man – about underdogs making a comeback – inspire us to think we ourselves can rebound from any setback.

Is your business intelligence dashboard the underdog at your organization? Dashboards have been around for decades, with some companies not putting the time and effort into updating them regularly to keep pace with the innovations in BI and the growing expectations of users.

 

3-analytics-movies"Pitch Perfect" And How Analytics Are Transforming Movie Marketing

When Universal released the cult musical film Pitch Perfect in 2012, they did what any self-respecting studio would do: They commissioned marketing reports and forecasted ticket sales for the Anna Kendrick-starring movie. Among them was an analysis by a company called Fizziology which data-mines social media to see how the film would play out with audiences. Continue reading

2014 US growth forecasts of Business Intelligence vendors

There are many ways of measuring the growth of business intelligence vendors. One approach of interest in the era of self-service analytics is to measure the growth in web search volume. Derived from web search volume data from Google, the following analyses can serve as a useful reference to understand which companies/products are growing in popularity and which may be falling out of favor.

The estimates in all of the following analyses are based on simple web search volume indices from the United States through the end of November, 2013. Using historic search volume data, forecasts were built for each company/product and growth rates for 2014 were derived from these forecasts.

I would group these companies into three categories
fast growers- Tableau, PowerPivot, Qlikview, BIRST and GoodData;
the growers- Spotfire and Microstrategy,
and mature products- Oracle BI, SAS, Cognos, SPSS, SQL Server, Actuate and Business Objects.
Continue reading

Top News – Data, Data Warehousing, Analytics & BI

November 18th, 2013
Stephen McDaniel
Chief Data Officer Advisor at Freakalytics, LLC

Finding it hard to make time to keep up with the rapidly changing world of data, data warehousing, analytics, data science, business intelligence and visual analytics? We understand! Here’s our curated summary of relevant news that could help with your future data and analytic projects. We also add commentary on the topic, a summary of the article and the link to read the full article.

There are four articles in this update:
     Amazon wades into big data streams with Kinesis
     Top 10 Trends in Text Analytics
     Effective Customer Analytics Call for Data Integration, Culture Shifts
     Your Car Is a Data Platform, What Can It Tell About You?

Missed our last issue of Top News, November 15th? Stories included RapidMiner (free and premium data mining), big data not top CFO priority, the DATA Act passes Senate, SAS replacing PowerPoint and big data sources to consider at your company.
 
 
 
 
 
1_1Amazon wades into big data streams with Kinesis

Amazon adds another layer to data storage and streaming options-Kinesis. Kinesis is all about real-time data collection and aggregation in a hosted cloud-scalable from Megabytes to Terabytes per hour! As such, it is a service that keeps your data for a maximum of 24 hours, by which time you presumably used it or stored it in a data warehouse (like Amazon Redshift), Hadoop system (like Amazon Elastic Map Reduce), NoSQL system (like Amazon Dynamo DB) or file store (like Amazon S3!) Do you notice a trend here?
Continue reading

Top News – Data, Data Warehousing, Analytics & BI

November 15th, 2013
Stephen McDaniel
Chief Data Officer Advisor at Freakalytics, LLC

Finding it hard to make time to keep up with the rapidly changing world of data, data warehousing, analytics, data science, business intelligence and visual analytics?  We understand! Here’s our curated summary of relevant news that could help with your future data and analytic projects. We also add commentary on the topic, a summary of the article and the link to read the full article.

There are five articles in this update:
     Rapid-I data mining now RapidMiner, the Redhat of data mining?
     Integrating data and mobile trumps big data for many CFOs
     Bipartisan DATA Act unanimously approved by Senate Committee
     Can SAS Visual Analytics replace PowerPoint?
     Big data sources to consider for your company

Missed our last issue of Top News, November 11th? Stories included Big Data and Society, Data Mining Blues, 2014 INFORMS Conference, Facebook's Free Big Data System for Analysts, Adaptive Data Preparation, How Trust Affects the Use of Analytics and Meeting a VAST Challenge
 
 
 
 
 
2Rapid-I data mining now RapidMiner,
the Redhat of data mining?

German predictive analytics, data mining and text mining company receives $5M in funding and announces a planned move of their headquarters to Boston from Dortmund, Germany. I would liken it to the Redhat of data mining, with a free community edition and paid corporate editions that adds support, more data sources and more capabilities. With over 3 million downloads, 20,000 deployments and 400 paid customers including eBay, Intel, PepsiCo and Kraft you may want to consider RapidMiner for your advanced analytics projects. The 2013 KDNuggets poll showed RapidMiner’s free edition ahead of every other advanced analytics choice including R.
Continue reading

Top News – Data, Data Warehousing, Analytics & BI

November 11th, 2013
Stephen McDaniel
Chief Data Officer Advisor at Freakalytics, LLC

i5_2Finding it hard to make time to keep up with the rapidly changing world of data, data warehousing, analytics, data science, business intelligence and visual analytics?  We understand! Here's our curated summary of relevant news that could help with your future data and analytic projects. We also add commentary on the topic, a summary of the article and the link to read the full article.

There are seven articles in this update:
     How Big Data Is Changing Science (and Society)
     Big data blues: The dangers of data mining
     2014 INFORMS Conference on the Business of Big Data
     Facebook System for Massive Big Data (Hadoop FS) Offered Free to World
     Paxata Launches Industry’s First Adaptive Data Preparation Platform
     C-Suite and Trust Both Affect Financial Returns on Analytics, Big Data
     Meeting a VAST challenge – Lincoln Laboratory staff create winning visualization
 
 
 
 
 
i6How Big Data Is Changing Science (and Society)

Traditional statistical approaches that long dominated scientific research are being challenged and augmented by new approaches from the fields of big data and data science.

HOW CAN YOU PREDICT something without understanding it? Simple: Continue reading