1、Big data,computer specialty,Do you know?,Taobao search,definition,definition,Characteristics:,Volume : data size Velocity :speed of change Variety : different forms of data sources,“Big data is high volume, high velocity, and/or high variety information assets that require new forms of processing to
2、 enable enhanced decision making, insight discovery and process optimization.“ Gartner,Big data is a term applied to data sets whose size is beyond the ability of commonly used software tools to capture ,manage and process the data within a tolerable elapsed time.,(过去的),application,E-commerce,Medica
3、l treatment,transportation,Taobao Amazon,1.3 million transactions in 2015 worldwide;,TransDec (Transportation Decision- Making) Baidu map,application,Bank transactions,EHR(electronic health record);,EHR electronic health record,EHRs offer plenty of datatest results, diagnoses, prescriptions, emergen
4、cy room (ER) visits, previous hospitalizations, demographic information.The software works by sifting through records of patients who were previously hospitalized and learning which risk factora certain number of chest complaints or an unusual level of a particular enzyme in the heart, for examplemi
5、ght have been red flags. The algorithm then uses those red flags to warn of future hospitalizations.,opportunities,opportunities,data revolutiontoday a massive amount of data is regularly being generated and flowing from various sources, through different channels, every minute in todays Digital Age
6、.Now: available digital data:150 EB(2005) 1200 EB(2010)Predicted: the stock of digital data is expected to increase 44 times between 2007 and 2020, doubling every 20 months.,The early years of data revolution:,challenges,challenges,DataAnalysis“what is the data really telling us?”,privacyaccess and
7、sharing,summarizing the data interpreting defining and detecting anomalies,fig. New types of research data about human behavior and society pose many opportunities if crucial infrastructural challenges are tackled.,Words and sentences,5,words,B2C : business-to-consumer 企业对消费者 Cookie : 指的是指网站为了辨别用户身份
8、而储存在用户本地终端浏览器上的一类数据。 TB : terabyte 1TB=1024MB PB : petabyte 1PB=1024TB EB : exabyte 1EB=1024PB ETL : Extract Transform Load,是指数据的提取、转换、加载。 Database : 数据库,sentences,1、The trend is especially impressive in Sub-Saharan Africa, (where mobile phone technology has been used as a substitute (for usually we
9、ak telecommunication and transport infrastructure ) (as well as underdeveloped financial and banking systems).,():定语,修饰Sub-Saharan Africa ():介词 ():并列作用,这个趋势在撒哈拉以南尤其令人印象深刻,这里的移动电话技术已经被用来作为弱电信和交通基础设施以及欠发达的银行和金融系统的替代品。,sentences,2、(Initially developed in such fields as computational biology , biomedica
10、l engineering, medicine, and electronics, )Big Data analytics refers to (tools and methodologies) that( aim to transform massive quantities of raw data into “data about the data”for analytical purposes).,大数据起初在生物学,生物医学工程,医学,电子开发等领域发展,它是为了将庞大数量的原始数据转变为 -用于分析的目的“有关数据的数据”的工具和方法。,conclusion,Part 6,concl
11、usion,Part 6,Data on todays scales require scientific and computational intelligence.Big Data Future is a free, public, multidisciplinary conference on the possibilities for new enterprises grounded in “big data” to improve economic, social, and political life. What is needed is both intent and capacity to be sustainedand strengthened, on the basis of a full recognition of the opportunities and challenges.,Thank you,