|||
摘要 [目的] 结合机构产出SCI论文统计需求,设计一款自动甄别目标机构作者和实验室的软件。[应用背景] 可辅助论文统计部门快速准确识别机构论文作者和实验室(部门),进而获得机构作者和实验室的论文产出分布情况。[方法] 从技术上实现综合利用相同研究单元内作者合作较多的科研特点、自定义作者唯一关键词或合作者字段以及SCI数据库作者相关字段的文本特征来甄别目标机构作者。[结果] 允许用户通过目标机构人员名单维护来实现SCI论文作者甄别的自动化和高准确度。[结论] 有效解决SCI论文中文作者因拼音写法多样且易重名而造成作者相关论文数据难以准确统计的问题,其设计思路也适用于EI及其他数据库论文作者甄别。 该软件在实现甄别功能的同时也具有清理机构论文数据的功能, 排除用户目标机构名称唯一标识词输入不全的因素, 提取不到目标机构作者信息的论文很 可能不是目标机构所发表的论文。 | ||||||||
关键词 : 论文统计, 作者甄别, SCI, 软件设计 | ||||||||
Abstract: [Objective] The software to discriminate one scientific institute's authors of scientific papers is designed to meet demands of the statistics of papers indexed by SCI. [Context] It can be used to help the department of statistical analysis on papers in SCI to determine Chinese characters for the Chinese author name belong to their institute and its corresponding lab.[Methods] Author discrimination is implemented technically by the comprehensive utilization of one characteristics of scientific research that people from the same research units are more likely to co-author papers, custom unique keywords or co-authors and text features of author fields in SCI. [Results] Automation and high accuracy of author discrimination can be achieved based on maintenance of a personnel list of one scientific institute. [Conclusions] It effectively solves the duplication problem of Chinese names during the analysis of papers in SCI and its design ideas also apply to other databases such as EI and Inspec. | ||||||||
Key words: Papers statistics Author discrimination SCI Software design |
致谢基金资助: 本文系中国科学院研究所情报分析可持续服务能力建设子项目“中科院高能所情报分析可持续服务能力建设”(项目编号:院1105)和中国科学院国家科学图书馆青年人才领域前沿项目“学科化知识服务辅助工具优化设计”(项目编号:Q1209)的研究成果之一。 致谢中科院高能所文献信息部于润升主任的选题鼓励和指导。感谢现代图书情报技术编辑部的多次专业修改建议! |
全文pdf下载链接:http://www.infotech.ac.cn/CN/Y2014/V30/I4/78
通讯作者: 于健 E-mail:yuj@mail.las.ac.cn E-mail: yuj@mail.las.ac.cn
Archiver|手机版|科学网 ( 京ICP备07017567号-12 )
GMT+8, 2024-11-23 17:01
Powered by ScienceNet.cn
Copyright © 2007- 中国科学报社