Word Embedding Bootstrapped Deep Active Learning Method to Information Extraction on Chinese Electro

来源 :上海交通大学学报(英文版) | 被引量 : 0次 | 上传用户:jerryzhang1805
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Electronic medical record (EMR) containing rich biomedical information has a great potential in disease diagnosis and biomedical research.However,the EMR information is usually in the form of unstructured text,which increases the use cost and hinders its applications.In this work,an effective named entity recognition(NER) method is presented for information extraction on Chinese EMR,which is achieved by word embedding bootstrapped deep active learning to promote the acquisition of medical information from Chinese EMR and to release its value.In this work,deep active learning of bi-directional long short-term memory followed by conditional random field (Bi-LSTM+CRF) is used to capture the characteristics of different information from labeled corpus,and the word embedding models of contiguous bag of words and skip-gram are combined in the above model to respectively capture the text feature of Chinese EMR from unlabeled corpus.To evaluate the performance of above method,the tasks of NER on Chinese EMR with “medical history” content were used.Experimental results show that the word embedding bootstrapped deep active learning method using unlabeled medical corpus can achieve a better performance compared with other models.
其他文献
Bearing incipient fault characteristics are always submerged in strong background noise with weak fault characteristics,so that the incipient fault is hard to detect.Stochastic resonance (SR) is accepted to be an effective way to detect the incipient;howe
Some fast finite impulse response (FIR) filters use a large number of look-up tables (LUTs) to configure distributed random-access memories (RAMs) and save registers.The distributed RAMs store 2M precomputed sums of M permuted operands in order to simplif
Karma alloy thin film strain gauges were fabricated on alumina substrates by magnetron sputtering.The electrical properties of strain gauges annealed at different temperatures were then tested.The surface mor-phology and phase structure of the Karma alloy
As an important indicator for the appearance and intrinsic quality of textiles,fabric flatness is the immediate cause affecting the aesthetic appearance and performance of textiles.In this paper,the objective evaluation system of fabric flatness based on
Magnetic resonance-guided focused ultrasound surgery (MRgFUS) is an emerging,non-invasive hy-perthermia technology which can be used for the treatment of benign and malignant tumours,in conjunction with intracranial neurological diseases.To treat differen
为获得生物基PA56短纤染色前最佳去油方法,采用水洗法、洗涤剂法、乙醇法、乙醚萃取法测定生物基PA56短纤的含油率.采用SEM观察去油前后纤维表面形貌,采用EDS分析去油前后纤维表面元素.结果表明,水洗法得到的纤维含油率为1.195%,最接近实际上油率;纤维表面光滑,无损伤,无油剂及杂质残留.水洗法操作简单,无需化学溶剂,绿色环保.
采用黄柏、槐米与茜草中提取的植物染料对黏胶织物进行功能整理.测试表明,黄柏、槐米与茜草中提取的植物染料对远紫外线与近紫外线具有较好的吸收性能.染色后织物的抗紫外线性能均有不同程度提高,其折皱回复角、回潮率与断裂强力不同程度下降,变化幅度与植物染料的上染率呈正相关关系.经摩擦后,黏胶织物与染色皂洗织物相比,折皱回复角、回潮率略高,断裂强力略低,对织物的服用性能影响较小.研究认为:黄柏、槐米与茜草染料对黏胶织物进行防紫外线功能整理具有较强的可行性.
根据永定河官厅水文站1956年~2016年61 a实测月径流资料,采用Tennant法、90%保证率最枯月平均流量法、基流比例法估算永定河官厅下游段生态基流。综合对比分析确定,永定河官厅下游段生态基流为2.05~3.93 m3/s;结合生态流量与现状年河道实际流量进行对比,评估分析三种方法计算得出的河道生态基流的保障程度表明,基流比例法在该区域表现最好,而90%保证率最枯月平均流量法计算得出的结果偏小,Tennant法在计算永定河此类型的北方季节性河流有明显的不足。
This work aims to explore the restoration of images corrupted by impulse noise via distribution-transformed network (DTN),which utilizes convolutional neural network to learn pixel-distribution features from noisy images.Compared with the traditional medi
To improve the signal detection performance of binary-sequence frequency hopping communication when the complementary channel is jammed,a binary-sequence frequency hopping communication system based on pseudo-random liner frequency modulation (LFM) is pro