Welcome to Scribd!

Skip carousel

Project

Uploaded by

Dang Huu Anh

0% found this document useful (0 votes)

8 views2 pages

big data project

Original Title

185636_1967764288_project

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

big data project

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

8 views2 pages

Project

Uploaded by

Dang Huu Anh

big data project

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

Big data processing projects are to transform streams of unstructured/semi structured data

such as data from social media sites or a system generated log files into structured files to

create a database/collections to query. Such structured files could be tables in RDBMS,

Key Value Stores, JSON files, CSV(Comma Separated Value) files for Mongo DB,

Cassandra, Volt DB or a Document Collection in Hbase in HDFS.

1. LinkedIn data transformation into either one of the platforms:

1-1) Tables in RDBMS (MS SQL Server using LINQ or any database server with Java/JDBC) or VoltDB

1-2) CS (Comma Separated) Files or any structured Files (Key Value Store, JSON) in Hadoop to query with
any tools such as Flume, Hive, Pig Latin Hbase, Mongo DB and more.

Avatara: OLAP for Webscale Analytics Products

Lili Wu Roshan Sumbaly Chris Riccomini Gordon Koo Hyung Jin Kim Jay Kreps Sam Shah LinkedIn The “Big
Data” Ecosystem at LinkedIn Roshan Sumbaly, Jay Kreps, and Sam Shah LinkedIn

2. Facebook Timeline message data transformation into either one of the platforms:

2-1) Tables in RDBMS (MS SQL Server using LINQ or any database server with Java/JDBC) or VoltDB.

2-2) Key Value Stores, JSON, CSV(comma separated value) files or any structured Files in Hadoop to
query with any tools applicable such as Flume, Hive, Pig Latin, Hbase, Mongo DB, Cassandra and more.

Related papers to read: will be given Petabyte Scale Databases and Storage Systems Deployed at
Facebook. Dhruba Borthakur

3. Facebook Friends Social Network (Graph API) data transformation into either one of the platforms:
CIS 612 Sunnie S Chung Cleveland State University

3-1) Tables in RDBMS (MS SQL Server using LINQ or any database server with Java/JDBC), or VoltDB

3-2) Key Value Stores, JSON, CSV(comma separated value) files or any structured Files in Hadoop to
query with any tools applicable such as

Flume, Hive, Pig Latin, Hbase, Mongo DB, Cassandra and more.

Data Warehousing and Analytics Infrastructure at Facebook, in SIGMOD 2010 by Ashish Thusoo
(Facebook), et al,

http://hive.apache.org/

Muppet: MapReduceStyle Processing of Fast Data

Wang Lam1, Lu Liu1, STS Prasad1, Anand Rajaraman1, Zoheb Vacheri1, AnHai Doan1,2 WalmartLabs,
University of Wisconsin Madison
4. Twitter Message data transformation into either one of the platforms:

4-1) Tables in RDBMS (MS SQL Server using LINQ or any database server with Java/JDBC), or VoltDB.

4-2) Key Value Stores, JSON, CSV(comma separated value) files or any structured Files in Hadoop to
query with any tools applicable such as Flume, Hive, Pig Latin, Hbase, Mongo DB, Cassandra and more.

Related papers to read: will be given The Unified Logging Infrastructure for Data Analytics at Twitter
George Lee, Jimmy Lin, Chuang Liu, Andrew Lorek, and Dmitriy Ryaboy Twitter, Inc.

Fast Data in the Era of Big Data: Twitter’s Real-Time Related Query Suggestion Architecture Gilad
Mishne, Jeff Dalton, Zhenghua Li, Aneesh Sharma, Jimmy Lin Twitter, Inc.

5. Transform log files in any system into either one of the platforms:

5-1) Tables in RDBMS (MS SQL Server using LINQ or any database server with Java/JDBC), or VoltDB CIS
612 Sunnie S Chung Cleveland State University

5-2) Key Value Stores, JSON, CSV(comma separated value) files or any structured Files in Hadoop to
query with any tools applicable such as Flume, Hive, Pig Latin, Hbase, Mongo DB, Cassandra and more.

6. Transform any electronic books on line into

6-1) Tables in RDBMS (MS SQL Server using LINQ or any database server with Java/JDBC), ) or VoltDB.

6-2) Key Value Stores, JSON, CSV(comma separated value) files or any structured Files in Hadoop to
query with any tools applicable such as Flume, Hive, Pig Latin, Hbase, Mongo DB, Cassandra and more.

ADB HOL # 5-1 - Understanding Connectivity Services - Oracle Community
Document4 pages
ADB HOL # 5-1 - Understanding Connectivity Services - Oracle Community
Dang Huu Anh
No ratings yet
Migrate Existing Oracle Database to Autonomous Database
Document8 pages
Migrate Existing Oracle Database to Autonomous Database
Dang Huu Anh
No ratings yet
ADB HOL # 3-2 - Connect With SQL - Plus - Oracle Community
Document4 pages
ADB HOL # 3-2 - Connect With SQL - Plus - Oracle Community
Dang Huu Anh
No ratings yet
Mongodb Javascript Jobs
Document14 pages
Mongodb Javascript Jobs
Dang Huu Anh
No ratings yet
DG 18cnew
Document20 pages
DG 18cnew
Dang Huu Anh
No ratings yet
ADB HOL # 3-1 - Prepare A Virtual Machine As A Development Environment - Oracle Community
Document8 pages
ADB HOL # 3-1 - Prepare A Virtual Machine As A Development Environment - Oracle Community
Dang Huu Anh
No ratings yet
Load CSV Data from Oracle Object Storage into Autonomous Database
Document9 pages
Load CSV Data from Oracle Object Storage into Autonomous Database
Dang Huu Anh
No ratings yet
Document 2467681.1
Document3 pages
Document 2467681.1
Dang Huu Anh
No ratings yet
2015-K-Mw-Herv Schweitzer-Oracle Goldengate Konfiguration in Der Dr-Umgebung Geschuetzt-Praesentation
Document29 pages
2015-K-Mw-Herv Schweitzer-Oracle Goldengate Konfiguration in Der Dr-Umgebung Geschuetzt-Praesentation
Dang Huu Anh
No ratings yet
MariaDB Technology Update: Hybrid Transactional and Analytical Processing
Document27 pages
MariaDB Technology Update: Hybrid Transactional and Analytical Processing
Dang Huu Anh
No ratings yet
AIX Tuning For Oracle DB PDF
Document63 pages
AIX Tuning For Oracle DB PDF
mailsharadj7301
No ratings yet
D101882GC10 SG PDF
Document210 pages
D101882GC10 SG PDF
Dang Huu Anh
No ratings yet
ALTER TABLE Improvements in MariaDB
Document28 pages
ALTER TABLE Improvements in MariaDB
Dang Huu Anh
No ratings yet
24-7support 2ndquadrant
Document1 page
24-7support 2ndquadrant
Dang Huu Anh
No ratings yet
D101882GC10 SG PDF
Document210 pages
D101882GC10 SG PDF
Dang Huu Anh
No ratings yet
Parallel Processing With Autonomous Databases in A Cluster System
Document24 pages
Parallel Processing With Autonomous Databases in A Cluster System
Dang Huu Anh
No ratings yet
How To Use Database Firewall As Traffic Sources Using In-Line Deployment in DAM Operating Mode
Document10 pages
How To Use Database Firewall As Traffic Sources Using In-Line Deployment in DAM Operating Mode
Dang Huu Anh
No ratings yet
D87557GC20 SG
Document424 pages
D87557GC20 SG
Dang Huu Anh
100% (1)
Oow18 Adwc1 Print 1539387195281001rm8e
Document163 pages
Oow18 Adwc1 Print 1539387195281001rm8e
Dang Huu Anh
No ratings yet
Oracle Mysql Free Vs Commercial.0ca595bb1312
Document14 pages
Oracle Mysql Free Vs Commercial.0ca595bb1312
Dang Huu Anh
No ratings yet
Oracle Key Vault Active Data Guard TDE
Document13 pages
Oracle Key Vault Active Data Guard TDE
Dang Huu Anh
No ratings yet
Dnfs Workshop Ebernal
Document5 pages
Dnfs Workshop Ebernal
Thangavelu Agathian
No ratings yet
Introducing Ysql Enterprise Edition: Max Phoon - Channel Sales, Mysql Asean 2019 August
Document18 pages
Introducing Ysql Enterprise Edition: Max Phoon - Channel Sales, Mysql Asean 2019 August
Dang Huu Anh
No ratings yet
King AutonomousDataWarehouseCloud
Document34 pages
King AutonomousDataWarehouseCloud
Dang Huu Anh
No ratings yet
How To Setup Federation Between Two DB2 LUW Databases
Document5 pages
How To Setup Federation Between Two DB2 LUW Databases
Dang Huu Anh
100% (1)
Mysql Workshop Environment: Ivan Ma 2019-03
Document8 pages
Mysql Workshop Environment: Ivan Ma 2019-03
Dang Huu Anh
No ratings yet
Oracle Server X6-2: Product Overview
Document6 pages
Oracle Server X6-2: Product Overview
Dang Huu Anh
No ratings yet
Introducing Ysql Enterprise Edition: Max Phoon - Channel Sales, Mysql Asean 2019 Feb
Document18 pages
Introducing Ysql Enterprise Edition: Max Phoon - Channel Sales, Mysql Asean 2019 Feb
Dang Huu Anh
No ratings yet
Data Processing Inside PostgreSQL
Document88 pages
Data Processing Inside PostgreSQL
Dang Huu Anh
No ratings yet
Mariadb Maxscale HA
Document12 pages
Mariadb Maxscale HA
Dang Huu Anh
No ratings yet
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Rating: 4 out of 5 stars
4/5 (5783)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Rating: 4 out of 5 stars
4/5 (98)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Rating: 4.5 out of 5 stars
4.5/5 (838)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Rating: 4.5 out of 5 stars
4.5/5 (537)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Rating: 4.5 out of 5 stars
4.5/5 (271)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
Rating: 3.5 out of 5 stars
3.5/5 (738)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Rating: 4 out of 5 stars
4/5 (890)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Rating: 3.5 out of 5 stars
3.5/5 (399)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Rating: 4.5 out of 5 stars
4.5/5 (234)
Yes Please
From Everand
Yes Please
Amy Poehler
Rating: 4 out of 5 stars
4/5 (1888)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Rating: 4 out of 5 stars
4/5 (587)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Rating: 4.5 out of 5 stars
4.5/5 (265)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Rating: 3.5 out of 5 stars
3.5/5 (231)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Rating: 4 out of 5 stars
4/5 (72)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Rating: 4.5 out of 5 stars
4.5/5 (474)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
Rating: 4 out of 5 stars
4/5 (599)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Rating: 3.5 out of 5 stars
3.5/5 (137)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Rating: 4.5 out of 5 stars
4.5/5 (344)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
Rating: 4 out of 5 stars
4/5 (45)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
Rating: 4.5 out of 5 stars
4.5/5 (806)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Rating: 3.5 out of 5 stars
3.5/5 (2219)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
Rating: 4.5 out of 5 stars
4.5/5 (440)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Rating: 4 out of 5 stars
4/5 (1090)
John Adams
From Everand
John Adams
David McCullough
Rating: 4.5 out of 5 stars
4.5/5 (2409)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
Rating: 4 out of 5 stars
4/5 (1015)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
Rating: 4.5 out of 5 stars
4.5/5 (1711)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
Rating: 4 out of 5 stars
4/5 (1800)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
Rating: 3.5 out of 5 stars
3.5/5 (2322)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
Rating: 4.5 out of 5 stars
4.5/5 (4609)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Rating: 4.5 out of 5 stars
4.5/5 (119)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
Rating: 4.5 out of 5 stars
4.5/5 (789)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Tóibín
Rating: 3.5 out of 5 stars
3.5/5 (1937)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
Rating: 4 out of 5 stars
4/5 (3811)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
Rating: 3.5 out of 5 stars
3.5/5 (791)
Little Women
From Everand
Little Women
Louisa May Alcott
Rating: 4 out of 5 stars
4/5 (104)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
Rating: 4.5 out of 5 stars
4.5/5 (2099)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Rating: 4 out of 5 stars
4/5 (4193)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
Rating: 4.5 out of 5 stars
4.5/5 (1929)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Rating: 4 out of 5 stars
4/5 (821)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Rating: 4 out of 5 stars
4/5 (1103)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
Rating: 3.5 out of 5 stars
3.5/5 (104)
Cluster in The Cloud Readthedocs Io en Latest
Document32 pages
Cluster in The Cloud Readthedocs Io en Latest
Siddu Balaganur
No ratings yet
HTTP Transformation Overview
Document10 pages
HTTP Transformation Overview
ypraju
100% (1)
ICP DAS WISE User Manual - V1.1.1en - 52xx
Document288 pages
ICP DAS WISE User Manual - V1.1.1en - 52xx
Yevgeniy Shabelnikov
No ratings yet
Full Stack For Frontend
Document137 pages
Full Stack For Frontend
Minh Thành
No ratings yet
Software-Defined Network Testbed Using ZodiacFX A
Document7 pages
Software-Defined Network Testbed Using ZodiacFX A
Gerlen Medina
No ratings yet
Service Log
Document6 pages
Service Log
Alexandre Gibert MW
No ratings yet
Behavioral Patterns: Part-1
Document77 pages
Behavioral Patterns: Part-1
biruk
No ratings yet
AngularJS interview questions and answers
Document8 pages
AngularJS interview questions and answers
GURU
No ratings yet
Extension Guide For FIORI FactSheets
Document9 pages
Extension Guide For FIORI FactSheets
jarlei
No ratings yet
Filing in C
Document6 pages
Filing in C
Khalid Habib
0% (1)
CPSC503 Project 1 Fall 2015
Document4 pages
CPSC503 Project 1 Fall 2015
rozeny2k
No ratings yet
Clean Wipe
Document2 pages
Clean Wipe
John Altenbach
No ratings yet
f5 Tmos Operations Guide
Document241 pages
f5 Tmos Operations Guide
Henry Flores Solis
No ratings yet
2052
Document107 pages
2052
Vkrishna Soladm
No ratings yet
Debugging React Native
Document12 pages
Debugging React Native
Mrinmoy Shee
No ratings yet
Hospital Management System Software Requirements
Document17 pages
Hospital Management System Software Requirements
ijaz khan
No ratings yet
The Art of Interpreting An AWR
Document39 pages
The Art of Interpreting An AWR
pavan0927
No ratings yet
Configuring The Selinux Policy: Stephen Smalley
Document33 pages
Configuring The Selinux Policy: Stephen Smalley
shahidhassan
No ratings yet
Google-Web-Toolkit (GWT) : By: Bradford Stimpson COMP 529
Document23 pages
Google-Web-Toolkit (GWT) : By: Bradford Stimpson COMP 529
Masinde Andrew
No ratings yet
Fortinet Security Fabric Overview 1653078793
Document1 page
Fortinet Security Fabric Overview 1653078793
hshs siidx
No ratings yet
High Performance Appliances Data Sheet 1
Document2 pages
High Performance Appliances Data Sheet 1
Nhung Quach
No ratings yet
Review: Big Data Techniques of Google, Amazon, Facebook and Twitter
Document8 pages
Review: Big Data Techniques of Google, Amazon, Facebook and Twitter
معن الفاعوري
No ratings yet
Unit 1 - Planning For Security
Document68 pages
Unit 1 - Planning For Security
Prayrit Jain
No ratings yet
Select From Employee Where Rowid Select Max (Rowid) From Employee
Document5 pages
Select From Employee Where Rowid Select Max (Rowid) From Employee
siva sanniboina
No ratings yet
Add Hosts Using The Command-Line Interface - SAP Help Portal
Document3 pages
Add Hosts Using The Command-Line Interface - SAP Help Portal
Devender5194
No ratings yet
A Blockchain-Based Architecture Framework
Document6 pages
A Blockchain-Based Architecture Framework
Kibrom Haftu
No ratings yet
Coronis: An EWOS Evolution
Document2 pages
Coronis: An EWOS Evolution
Taufiq Raihan Hidayat
No ratings yet
RCU-User Guide v5.00
Document43 pages
RCU-User Guide v5.00
Bhayu alfian
No ratings yet
Java by Kiran - Core Java, J2EE, Spring training in Pune
Document3 pages
Java by Kiran - Core Java, J2EE, Spring training in Pune
manikanta mutyala
No ratings yet
Devops (Cams)
Document57 pages
Devops (Cams)
rajat rawat
No ratings yet