Barge Database

来源 :KNOWLEDGE IS POWER | 被引量 : 0次 | 上传用户:ywdiy_cn
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  The last week of September, 2014 saw the official listing of Alibaba( A China’s giant company) in the New York Stock Exchange(NYSE:BABA), which is the first Initial public offerings( IPO) and the largest scale in history, also marks the Interent evolves into a new era---a big data era that belongs to Chinese domestic internet enterprises.
  The past and present big data
  Big data or mass data refers to the data size is so large that it can not be extracted, managed, handled and processed as the information that can be interpreted by human beings with a proper range of time. Under the same condition, compared with those independent small-scale dataset which could analyzed data individually, more additional information and relational data base will be obtained if the analysis is based on the grouping of each small data. Such approach can be applied to forecast the commercial trend, judge the quality of research, avoid the widespread of disease, fight against crimes or predict real-time traffic and others.
  Though far away from our daily life, big data has close ties with our daily life in deed. For example, Douban Music( a name of a Chinese social network) can infer which song is most liked by a certain user after its analysis of behaviour of user population, even user’s favorite movie can also be inffered. Through confluence analysis of sales data of its retail stores, Adidas can exactly know the consumers’ preference over their products in different regional culture so as to make a more resonable strategy of inventory stocking up in a smarter way. A love and marrige website in China is trying to introduce a system that can identify facial resemblance, the company is able to conclude which facial form is most enjoyed by its users on the basis of used information, then they can provide such popular service among its users. Taobao(the biggest C2C shopping website in China’s mainland) can predict the possible goods that each consumer is interested in, thereout, individualized recommendation targeted to each user can be produced, this is what most people often see in the side bar of it commodity recommendation. Through the analysis of the information of classified commodities by large database model, Taobao is able to answer some interesting questions which are hard to most people, such as what is the favourite color of the T-shirt for the age group of 18 , or what is the difference between the people living in South and North China when it comes to preference of sports beverage?   The simple analysis of user behaviour will not produce too much value, while if the analysis is based on a quite large scale, then we can obtain valuable prediction from its performing trend, the decision-making in business in particular. In the past, take the well-known NongFu Spring (A Chinese enterprise of drinking water production) for example, if the company wants to get such market data to help them to make decisions as how to pile up can promote its sales? The people of which age group can spend most time in front of the pile? What is their purchasing volume each time? What changes of purchasing behaviour might take place for the change of temperature? How its competitor’s new packing influence its own sales? Though seem easy, these questions are hard to get convincing answers.
  To answer the above questions, a lot of data needs to be collected. The salesmen from NongFu Spring have to come to local supermarkets to take ten pictures every day: the piling of the bottles, the change of their location, the height of the bottle piling and so on. Every day they have to cover 15 places for investigation and survey, and upload 150 pictures, producing data size about 10M which is not a large figure. While there are 10,000 salesmen across China, that means the data size is 100G, 3TB each month. Though these data seem easy, but without the support of relevant technology concerning about big data, such analysis could not be obtained.
  There is one in Google had pointed out:” what really matters is not what we can do, but what is the right size can we do.”
  It only needs several pieces of paper and a pen if you can just analyze 100 lines of data every day. But if you want to analyze 100,000 lines of data, according to the processing capacity of modern computer, you just need a computer and design programme. But if the data size has reached 1000000000 lines(1TB), even a powerful server station will satisfy your need, especially when you want a real-time or close to real-time processing speed. Thus, the field of computer and numerical calculation witnesses the occurrence of a trend—distributed computing which is a science requires a system by the connecting of cluster of computers through network and then engineering data that needs massive calculation will be divided into small computing areas, then the data will be processed by each computer of the network, after uploading the calculating results which will be combined to arrive at a final data conclusion. But in order to make full use of distributed computing, we have to solve such problems as how to divide the data? How can we achieve a balanced processing of the operating load of each computer? How to combine each result into a final data efficiently? Many computing models and concepts have been designed for the purpose of solving these problems from the hardware and software of computers. Some of the most representative are cloud computing, MapReduce (Handoop) , virtualization and others. While this might only be the beginning of the computing tide. Just like Jack Ma had said:” we are moving from an era of information science and technology to an era of data science and technology.”   Mass data and
  the new occupations
  of the Internet
  To do well in mass data, the first thing of vital importance is to get massive valuable data, which is an advantage that most native Chinese Internet enterprises have. China has a large population, dynamic economy, millions of internet users, the abundance of user’s behavor data is directly decided by the abundance of user data resources. Taobao has 300 million registered users and Tencent’s registered users has already exceeded 1 billion. All the user data is absolutely a goldmine.
  A new generation technology is bound to bring up full demand of technicians of a new generation. In an era of big data, data scientist and data engineer have been one of the hottest occupation in Silicon Valley. Comparing to the traditional software engineer, data scientist is a group of researchers who stand between mathematics(statitics) and computer science, their job includes both software design and development and data modelling and statistic analysis, meantime, they are able to turn data processing model into feasible software solutions. So the native Chinese internet enterprises also attach great importance to the reservation of talents in the field of data science, in the foreseeable future, practitioners of data science must be very popular in the job market.
其他文献
任何摄影题材,都存在一些基本技术,无论你拍摄对像或主体、使用器材或地点,都应该作出考虑。即使是专业摄影师也可能会忽略一些基本规则。来看看摄影师Ed Gregory的分享,当中他提出了不少人像拍摄的注意事项,值得大家参考。  1、位置  当你选择拍摄人像的位置时,请确保模特儿的位置不在阳光直射底下,最好选择在阴凉或阴暗的环境当中。此外,当拍摄时亦应避免拍摄到复杂的背景,而应选择一个简约而不会令观众分
期刊
对于摄影师而言,怎样存储、管理自己手上成千上万而且越来越多的照片,保证它们不会损毁,而且在需要它们时能够迅速检索到无疑是一件值得重视的事情。最近我们有幸借来了NAS产品界知名厂商威联通QNAP)的最新产品TS-451A,通过对它的试用来详细介绍NAS会给摄影人带来哪些好处。  多磁盘阵列更安全的存储方式  NAS(Network Attached Storage),是一种基于网络 的数据存储中心。
期刊
美国乔·斯坦菲尔德|Joel Sternfeld (1944年-今) 以大幅攝影和充满表现力的色彩从70年代起成为彩色摄影的重要影响人物之一。  乔的脑袋里应该安装了一个关于精密色彩的度量衡,他通过对色彩的严谨控制利用来更全面地表现作品情绪和内涵这件事在70年代是股新颖的风潮。  乔的独到之处还在于他既回流大幅摄影又同时采用先锋代表"色彩"来进行纪实创作,远距离观望,冷静地见证事件,故事性地记叙,
期刊
拍照和其他技能一样,都是要靠不断的练习来提高自己的水平的,但是经常我们会遇到的一大问题就是,拍摄一段时间以后,会觉得什么都拍过了,再拍也只是拍摄从前拍过的东西,千篇一律的拍摄并不能带来新鲜感,只会带来疲累。你到了拍摄的瓶颈期了。  这时候你需要走出你最擅长的舒适区,引导自己来追求自己的创造力。  当你觉得自己没有拍摄灵感的时候,不妨可以试试下面这些小贴士。  1、为你的作品添加情感  照片中的情绪
期刊
Today, traffic jam is a common problem faced with many cities, metropolitan in particular, even the third-tier cities begin to suffer from traffic congestion. When we are stopped on the road, we will
期刊
在当模特被摄影师拍了接近一年之后,看着别人镜头下的自己,我突然萌生了买相机的念头。买什么相机好呢?如果买个好一点的数码机,预算又不太夠。之前一直给我拍照、我的胶片启蒙者尧神说,买个胶片机吧,一千多就可以买个差不多的,你看我给你拍了好多胶片,多好看。正好刘蛋白跟我说:“我这里有一个胶片机,你先拿去玩呗。”  她的胶片机是佳能AE-1。之前就在电影《只想爱着你》里看到过它的身影,宫崎葵和玉木宏拿着它在
期刊
“所有的水塔有一个共性,不管最初的建造目的如何,最终它们都会成为乡间的一座地标,某种意义上,它们是象征工业化的纪念碑。”  Gediminas Karbauskis,立陶宛摄影师,出生于1979年,这组作品名为《Water Towers of Luxembourg》,记录了卢森堡乡间造型各异的水塔。  “水塔一般用于储存水源,在小镇和乡间比较常见,卢森堡也不例外。一座乡间的水塔可以在周围几公里范围
期刊
即使是手机,使用镜面反射也可以拍出惊人的照片,那在哪里可以找到反射呢?以下两点可以给你提供一点灵感。  一、寻找有反射的地方  (1)光滑表面。不要将自己的目光限制在闪亮的地板上。墙壁(大理石)、桌面、门把手和汽车只是一个开始。当你外出拍摄时,请注意任何会反光的表面。  善用镜面技巧手机也能拍大片  (2)商场橱窗。商店橱窗中的反射可以使作品更加扑朔迷离――内部与外部的元素叠加以及光线的重组。  
期刊
Peter Adams 曾经说过:好的照片在于情感的深度,而非景深。  说是这么说,但是景深却是实实在在存在的。既然有景深,就会有焦外,就會存在模糊。当然一张照片模糊并不仅仅是焦外,还有可能是手抖等因素。想要拍出好照片,就一定要清楚。  别拿卡帕和森山大道的照片来举反例,卡帕只有一个,森山大道只有一个,而我们,则是千千万万分之一。  想要拍出一张主体清楚的照片,就要避免下面这些错误。  1、焦点对
期刊
自拍好像已经成为了我们日常生活中重要的一环,尤其是姑娘们,没事儿就喜欢拿着手机对着镜头嘟个嘴。无论在什么地方,在家里,在餐厅,在海滩上,还是在爱人的怀里,随着智能手机的发展以及社交網络的蔓延,自拍已经成了一种热度,烧坏了所有。  但是在胶片时代,臭美的他们又是怎么自拍的呢?
期刊