Data Science eBook - 2nd Edition (tentative Table of Content)
Introduction
Part I - Data Science Recipes
- New random number generator: simple, strong and fast
- Lifetime value of an e-mail blast: much longer than you think
- Two great ideas to create a much better search engine
- Identifying the number of clusters: finally a solution
- Online advertising: a solution to optimize ad relevancy
- Example of architecture for AaaS (Analytics as a Service)
- Why and how to build a data dictionary for big data sets
- Hidden decision trees: a modern scoring methodology
- Scorecards: Logistic, Ridge and Logic Regression
- Iterative Algorithm for Linear Regression
- Approximate Solutions to Linear Regression Problems
- Theorems for Traders
- Preserving metric and score consistency over time and across clients
- Advertising: reach and frequency mathematical formulas
- Real Life Example of Text Mining to Detect Fraudulent Buyers
- Discount optimization problem in retail analytics
- Sales forecasts: how to improve accuracy while simplifying models?
- How could Amazon increase sales by redefining relevancy?
- How to build simple, accurate, data-driven, model-free confidence i...
- Comprehensive list of Excel errors, inaccuracies and use of non-sta...
- 10+ Great Metrics and Strategies for Email Campaign Optimization
- 10+ Great Metrics and Strategies for Fraud Detection
- Case Study: Four different ways to solve a data science problem
- Case Study: Email marketing - analytic tips to boost performance b...
- Optimize keyword campaigns on Google in 7 days: an 11-step procedure
- How do you estimate the proportion of bogus accounts on Facebook?
- Stat models to solve astronomical mysteries - application to business data
- How to detect a pattern? Problem and solution
- From chaos to clusters - statistical modeling without models
- Simple solutions to make videos with R
- Three classes of metrics: centrality, volatility, and bumpiness
- How to optimize email campaigns? Part I
- Simple steps to increase speed of web crawling by a factor 80,000
- How are database joins optimized? How can you do better to handle big data?
- Correlation vs. causation
- Seven tricky sentences for NLP and text mining algorithms
- Great statistical analysis: forecasting meteorite hits
- Fast clustering algorithms for massive datasets
- How are hotel room rates determined
- The curse of big data
- Simple technique to improve poor predictive models
- Simple source code to simulate nice cluster structures
- Source code for our Big Data keyword correlation API
- Correlation vs. causation
- Shootings stars: producing videos about data
Part II - Data Science Discussions
- Statisticians Have Large Role to Play in Web Analytics (AMSTAT inte...
- Future of Web Analytics: Interview with Dr. Vincent Granville
- Connecting with the Social Analytics Experts
- Interesting note and questions on mathematical patents
- Big data versus smart data: who will win?
- Creativity vs. Analytics: Are These Two Skills Incompatible?
- Barriers to hiring analytic people
- Salary report for selected analytical job titles
- Are we detailed-oriented or do we think "big picture", or both?
- Why you should stay away from the stock market
- Gartner Executive Programs' Worldwide Survey of More Than 2,300 CIOs
- 4.4 Million New IT Jobs Globally to Support Big Data By 2015
- One Third of Organizations Plan to Use Cloud Offerings to Augment BI Capabilities
- Twenty Questions about Big Data and Data Sciences
- Interview with Drew Rockwell, CEO of Lavastorm
- Can we use data science to measure distances to stars?
- Eighteen questions about real time analytics
- Can any data structure be represented by one-dimensional arrays?
- Data visualization: example of a great, interactive chart
- Data science jobs not requiring human interactions
- Featured Data Scientist: Vincent Granville, Analytic Entrepreneur
- Healthcare fraud detection still uses cave-man data mining techniques
- Why are spam detection algorithms so terrible?
- What is a Data Scientist?
- Twenty seven types of data scientists: where do you fit?
- Seven tricky sentences for NLP and text mining algorithms
- How maths should be taught in high school
- An alternative to FICO scores?
- Shopper Alert: Price May Drop for You Alone | NewYorkTimes
- Vertical vs. Horizontal Data Scientists
- 14 questions about data visualization tools
- New, fast Excel to process billions of rows via the cloud
- When data flows faster than it can be processed
- Car accident statistics by profession
- Extreme Data Science
- Six keywords characterizing milestones in the history of analytic engineering: from 1988 to 2033
- A new idea for an analytic business startup
- Four innovative ideas to optimize business processes
- Vincent's answers to data science questions - Part 2
- How do data scientists rank?
- History, Evolution and Classification of Programming Languages
- Automated news feed optimization
- Shopper Alert: Price May Drop for You Alone
- Why are clinical trials failing?
- Is Algebra Necessary? | New York Times
- The 8 worst predictive modeling techniques
- What are the most difficult things to predict?
- Fake Data Science
- Analytics{Benzene} => {big Pharma, Nanotechnologies}
- What MapReduce can't do
- Big Data startup to fix healthcare
- How to reverse-engineer Google?
Part III - Data Science Resources
- Vincent’s list
- History of 24 analytic companies over the last 30 years
- Fifteen great data science articles from influential news outlets
- List of publicly traded analytic companies
- Thirty unusual applications of data sciences, analytics and big data
- 50 unusual ways analytics are used to make our lives better
- Berkeley course on Data Science
- Over 5,000,000 financial, economic and social datasets
- Map of Data Science University Programs
- 27 criteria to choose analytic tools
- Resources
- 50 big data events scheduled worldwide in May 2012
- Top analytic blogs and websites, with trending information
- 86 Helpful Tools for the Data Professional
- List of Free Statistical Software Packages
- List of programming languages
- Sample proposal for a data science / big data project
- Password and hijacked email dataset for you to test your data science skills
- Data Science Tools
- Data Science Dictionary
- Salary/Income of Analytics/Data Mining/Data Science professionals
- 66 job interview questions for data scientists
- Proposal for an Apprenticeship in Data Science