RSS 1.0 Feed
RSS 2.0 Feed
Atom Feed
 

Evaluation of Algorithm Performance on Identifying OA

Antelman, K., Bakkalbasi, N., Goodman, D., Hajjem, C. and Harnad, S. (2005) Evaluation of Algorithm Performance on Identifying OA. Technical Report UNSPECIFIED, North Carolina State University Libraries, North Carolina State University. (Unpublished)

Warning

There is a more recent version of this eprint available. Click here to view it.

Download

[img]
Preview
PDF
177Kb

Abstract

This is a second signal-detection analysis of the accuracy of a robot in detecting open access (OA) articles (by checking by hand how many of the articles the robot tagged OA were really OA, and vice versa). A first analysis, on a smaller sample (Biology: 100 OA, 100 non-OA), had found a detectability (d') of 2.45 and bias of 0.52 (hits 93%, false positives 16%; Biology %OA: 14%; OA citation advantage: 50%). The present analysis on a larger sample (Biology: 272 OA, 272 non-OA) found a detectability of 0.98 and bias of 0.78 (hits 77%, false positives, 41%; Biology %OA: 16%; OA citation advantage: 64%) An analysis in Sociology (177 OA, 177 non-OA) found near-chance detectability (d' = 0.11) and an OA bias of 0.99 (hits, 54%, false alarms, 49%; prior robot estimate Sociology %OA: 23%; present estimate 15%). It was not possible from these data to estimate the Sociology OA citation advantage. CONCLUSIONS: The robot significantly overcodes for OA. In Biology 2002, 40% of identified OA was in fact OA. In Sociology 2000, only 18% of identified OA was in fact OA. Missed OA was lower: 12% in Biology 2002 and 14% in Sociology 2000. The sources of the error are impossible to determine from the present data, since the algorithm did not capture URLs for documents identified as OA. In conclusion, the robot is not yet performing at a desirable level and future work may be needed to determine the causes, and improve the algorithm.

Creators:Kristin Antelman, Nisa Bakkalbasi, David Goodman, Chawki Hajjem, Stevan Harnad
Item Type:Technical Report
Keywords:signal detection analysis, citation analysis, open access, research impact, webmetrics
Research Group:Intelligence, Agents, Multimedia
Deposited On:16 Dec 2005 by Harnad, Stevan
ID Code:11689
Last Modified:11 Nov 2009 12:25
Performance Indicator:EZ~05~01~06

Available versions of this item

Tools

Metadata

Download Statistics

Last month

Last year

Members of ECS may view the download statistics dashboard for this record.

Corrections

ECS staff and postgraduates may modify this record

Available Versions of this Item

  Welcome from Deputy Head of School (Research) Research Prospectus Industrial Partnerships New Research Students Notes for Guidance New Research Students Notes for Guidance
The ECS EPrints Repository supports OAI 2.0 with a base URL of http://eprints.ecs.soton.ac.uk/cgi/oai2

EPrints is free software developed by the University of Southampton to facilitate Open Access to research.
EPrints