BigData
2016-05-10 13:08:59 2 举报
AI智能生成
BigData,即大数据,是指以前所未有的速度和规模产生的海量数据集合。这些数据通常包括结构化数据和非结构化数据。大数据的特点可以概括为“3V”:大容量、高速度、多样性。大数据技术旨在从这些庞大的数据集中发现有价值的信息,以支持决策制定、业务优化和创新。为了处理大数据,人们采用了各种技术和工具,如分布式计算框架(如Hadoop和Spark)、数据库技术(如NoSQL数据库)以及数据分析和可视化工具。大数据分析可以帮助企业更好地了解客户需求、优化供应链管理、提高生产效率等。总之,BigData已经成为当今数字时代的重要组成部分,对各行各业产生了深远的影响。
作者其他创作
大纲/内容
What element consist reference architecture?
Reference architecture
Data Source
Mobility
In situ
Streaming
Structure
Structured
Unstructured
Data process
Data extraction
Extraction
Stream extraction
Data loading and pre-processing
Transfer,load
Data compression
Data processing
Stream processing
Information extraction
Combining
Replication
Cleaning
Data analysis
Deep analystic
Stream analysis
Data loading and transformation
Transformation
Transfer,load
Data storage
Different data source and flow has different storage ways
Interfacing and visualiazation
Visualization
Dashboarding
End user
Job and model specifcation
Models
Model specifation
Machine learning
Jobs
Job specification
Job scheduleing
Use cases
Facebook
Infrasturcture and mapping
DataSource
MySQL--In situ and structure
Web service -- Streaming and semi-structure
Data processing
Data analysis
Hive jobs -- Deep analystic
Data loading and transformation
Cube generation -- Transfoemation
Interfacing and visualiazation
Microstrtegy UI -- app
HIPal -user app
others cases
Linkedin
Twiter
Netflix
BlockMon
Network measurement
Flu-miner
The goal of example
All of the published infrasturcture can map into the reference architecture
Answer
comprised of semi-detailed functional components and data stores, and data flows between them
How to classify?
Data storage or collection
Database
Nosql
Relational
In-memory
Log files
Other unique systems
Data processing
Batch processing
Stream processing
Virtualization
Commercial products
Services
Daas
SaaS
IaaS
Log service
Product
Analytics infrastructure
Operational infrastructure
Visualization/BI tools
Technology frameworks
Cloud solutions
0 条评论
下一页