Table Of ContentTable of Contents
1. Introduction
1. EMC Academic Alliance
2. EMC Proven Professional Certification
2. Chapter 1: Introduction to Big Data Analytics
1. 1.1 Big Data Overview
2. 1.2 State of the Practice in Analytics
3. 1.3 Key Roles for the New Big Data Ecosystem
4. 1.4 Examples of Big Data Analytics
5. Summary
6. Exercises
7. Bibliography
3. Chapter 2: Data Analytics Lifecycle
1. 2.1 Data Analytics Lifecycle Overview
2. 2.2 Phase 1: Discovery
3. 2.3 Phase 2: Data Preparation
4. 2.4 Phase 3: Model Planning
5. 2.5 Phase 4: Model Building
6. 2.6 Phase 5: Communicate Results
7. 2.7 Phase 6: Operationalize
8. 2.8 Case Study: Global Innovation Network and Analysis (GINA)
9. Summary
10. Exercises
11. Bibliography
4. Chapter 3: Review of Basic Data Analytic Methods Using R
1. 3.1 Introduction to R
2. 3.2 Exploratory Data Analysis
3. 3.3 Statistical Methods for Evaluation
4. Summary
5. Exercises
6. Bibliography
5. Chapter 4: Advanced Analytical Theory and Methods: Clustering
1. 4.1 Overview of Clustering
2. 4.2 K-means
3. 4.3 Additional Algorithms
4. Summary
5. Exercises
6. Bibliography
6. Chapter 5: Advanced Analytical Theory and Methods: Association Rules
1. 5.1 Overview
2. 5.2 Apriori Algorithm
3. 5.3 Evaluation of Candidate Rules
4. 5.4 Applications of Association Rules
5. 5.5 An Example: Transactions in a Grocery Store
6. 5.6 Validation and Testing
7. 5.7 Diagnostics
8. Summary
9. Exercises
10. Bibliography
7. Chapter 6: Advanced Analytical Theory and Methods: Regression
1. 6.1 Linear Regression
2. 6.2 Logistic Regression
3. 6.3 Reasons to Choose and Cautions
4. 6.4 Additional Regression Models
5. Summary
6. Exercises
8. Chapter 7: Advanced Analytical Theory and Methods: Classification
1. 7.1 Decision Trees
2. 7.2 Naïve Bayes
3. 7.3 Diagnostics of Classifiers
4. 7.4 Additional Classification Methods
5. Summary
6. Exercises
7. Bibliography
9. Chapter 8: Advanced Analytical Theory and Methods: Time Series Analysis
1. 8.1 Overview of Time Series Analysis
2. 8.2 ARIMA Model
3. 8.3 Additional Methods
4. Summary
5. Exercises
10. Chapter 9: Advanced Analytical Theory and Methods: Text Analysis
1. 9.1 Text Analysis Steps
2. 9.2 A Text Analysis Example
3. 9.3 Collecting Raw Text
4. 9.4 Representing Text
5. 9.5 Term Frequency—Inverse Document Frequency (TFIDF)
6. 9.6 Categorizing Documents by Topics
7. 9.7 Determining Sentiments
8. 9.8 Gaining Insights
9. Summary
10. Exercises
11. Bibliography
11. Chapter 10: Advanced Analytics—Technology and Tools: MapReduce and Hadoop
1. 10.1 Analytics for Unstructured Data
2. 10.2 The Hadoop Ecosystem
3. 10.3 NoSQL
4. Summary
5. Exercises
6. Bibliography
12. Chapter 11: Advanced Analytics—Technology and Tools: In-Database Analytics
1. 11.1 SQL Essentials
2. 11.2 In-Database Text Analysis
3. 11.3 Advanced SQL
4. Summary
5. Exercises
6. Bibliography
13. Chapter 12: The Endgame, or Putting It All Together
1. 12.1 Communicating and Operationalizing an Analytics Project
2. 12.2 Creating the Final Deliverables
3. 12.3 Data Visualization Basics
4. Summary
5. Exercises
6. References and Further Reading
7. Bibliography
14. End User License Agreement
List of Illustrations
1. Figure 1.1
2. Figure 1.2
3. Figure 1.3
4. Figure 1.4
5. Figure 1.5
6. Figure 1.6
7. Figure 1.7
8. Figure 1.8
9. Figure 1.9
10. Figure 1.10
11. Figure 1.11
12. Figure 1.12
13. Figure 1.13
14. Figure 1.14
15. Figure 2.1
16. Figure 2.2
17. Figure 2.3
18. Figure 2.4
19. Figure 2.5
20. Figure 2.6
21. Figure 2.7
22. Figure 2.8
23. Figure 2.9
24. Figure 2.10
25. Figure 2.11
26. Figure 3.1
27. Figure 3.2
28. Figure 3.3
29. Figure 3.4
30. Figure 3.5
31. Figure 3.6
32. Figure 3.7
33. Figure 3.8
34. Figure 3.9
35. Figure 3.10
36. Figure 3.11
37. Figure 3.12
38. Figure 3.13
39. Figure 3.14
40. Figure 3.15
41. Figure 3.16
42. Figure 3.17
43. Figure 3.18
44. Figure 3.19
45. Figure 3.20
46. Figure 3.21
47. Figure 3.22
48. Figure 3.23
49. Figure 3.24
50. Figure 3.25
51. Figure 3.26
52. Figure 3.27
53. Figure 4.1
54. Figure 4.2
55. Figure 4.3
56. Figure 4.4
57. Figure 4.5
58. Figure 4.6
59. Figure 4.7
60. Figure 4.8
61. Figure 4.9
62. Figure 4.10
63. Figure 4.11
64. Figure 4.12
65. Figure 4.13
66. Figure 5.1
67. Figure 5.2
68. Figure 5.3
69. Figure 5.4
70. Figure 5.5
71. Figure 5.6
72. Figure 6.1
73. Figure 6.2
74. Figure 6.3
75. Figure 6.4
76. Figure 6.5
77. Figure 6.6
78. Figure 6.7
79. Figure 6.10
80. Figure 6.8
81. Figure 6.9
82. Figure 6.11
83. Figure 6.12
84. Figure 6.13
85. Figure 6.14
86. Figure 6.15
87. Figure 6.16
88. Figure 6.17
89. Figure 7.1
90. Figure 7.2
91. Figure 7.3
92. Figure 7.4
93. Figure 7.5
94. Figure 7.6
95. Figure 7.7
96. Figure 7.8
97. Figure 7.9
98. Figure 7.10
99. Figure 8.1
100. Figure 8.2
101. Figure 8.3
102. Figure 8.4
103. Figure 8.5
104. Figure 8.6
105. Figure 8.7
106. Figure 8.8
107. Figure 8.9
108. Figure 8.10
109. Figure 8.11
110. Figure 8.12
111. Figure 8.13
112. Figure 8.14
113. Figure 8.15
114. Figure 8.16
115. Figure 8.17
116. Figure 8.18
117. Figure 8.19
118. Figure 8.20
119. Figure 8.21
120. Figure 8.22
121. Figure 9.1
122. Figure 9.2
123. Figure 9.3
124. Figure 9.4
125. Figure 9.5
126. Figure 9.6
127. Figure 9.7
128. Figure 9.8
129. Figure 9.9
130. Figure 9.10
131. Figure 9.11
132. Figure 9.12
133. Figure 9.13
134. Figure 9.14
135. Figure 9.15
136. Figure 9.16
137. Figure 10.1
138. Figure 10.2
139. Figure 10.3
140. Figure 10.4
141. Figure 10.5
142. Figure 10.6
143. Figure 10.7
144. Figure 11.1
145. Figure 11.2
146. Figure 11.3
147. Figure 11.4
148. Figure 12.1
149. Figure 12.2
150. Figure 12.3
151. Figure 12.4
152. Figure 12.5
153. Figure 12.6
154. Figure 12.7
155. Figure 12.8
156. Figure 12.9
157. Figure 12.10
158. Figure 12.11
159. Figure 12.12
160. Figure 12.13
161. Figure 12.14
162. Figure 12.15
163. Figure 12.16
164. Figure 12.17
165. Figure 12.18
166. Figure 12.19
167. Figure 12.20
168. Figure 12.21
169. Figure 12.22
170. Figure 12.23
171. Figure 12.24
172. Figure 12.25
173. Figure 12.26
174. Figure 12.27
175. Figure 12.28
176. Figure 12.29
177. Figure 12.30
178. Figure 12.31
179. Figure 12.32
180. Figure 12.33
181. Figure 12.34
182. Figure 12.35
Description:efforts, Big Data analytics enable new avenues of investigation with .. However, their proliferation can result in “many versions of the truth. Do the parameter values of the fitted model make sense in the context of the png(), bmp(), pdf(), and postscript(), are available in R to save plots in