Archive for the ‘02: Analysis’ Category
Big data needs Big science
A great article by the Webtrends CEO:
“We are living in “the age of big data, (…) No industry is untouched by big data, which is notably transforming the way social networks work today. However, the key factor that will determine success for companies in this age is not simply big data, but big science. (…)
The World Economic Forum’s report on data equated it with an asset such as gold. Others have declared that data is “the new oil.” But, as with gold or oil, data has no intrinsic value. (…) Gold requires mining and processing before it finds its way into our jewelry (…) Oil requires extraction and refinement before it becomes the gasoline that fuels our vehicles. Likewise, data requires collection, mining and, finally, analysis before we can realize its true value for businesses, governments, and individuals alike.
According to IDC, the amount of data that companies are wrestling with is growing at 50 percent per year — or more than doubling every two years. Many organizations are rich in data but poor in insight. That’s where big science comes in. (…)
The collection and mining of massive amounts of digital data currently defines the term big data. Those are processes that businesses largely handle. However, the analysis of that data — that magic ingredient of algorithms and advanced mathematics that bridges the gap between knowledge and insight — is big science. It is where the value is. It is the future. (…)
Put simply, the analysis that big science brings to the table makes big data relevant. I envision big science combining with big data to create big opportunities in three significant ways: real-time relevant content, data visualization, and predictive analytics. (…)
Read the full article at:
Yoder, Alex. Big data is worth nothing without Big science. C/Net, May 15, 2012. Available from: http://news.cnet.com/8301-1001_3-57434736-92/big-data-is-worth-nothing-without-big-science/ [Accessed on 15th of May 2012]
Journal article mining and semantic search: practices and promises
An impressive study research, sponsored by the Publishing Research Consortium: includes interviews of key-people from Pfizer, the CERN, Mendeley, the British Library, from TEMIS, Elsevier, Springer, Nature, Wiley, etc.
Journal Article Mining: a research study into Practices, Policies, Plans …..and Promises Eefke Smit and Maurits van der Graaf. PRC June 2011 153pp. This is a study commissioned by PRC which offers the first comprehensive look at what publishers and others are doing, and plan to do, in both data and text mining of the scholarly, mainly journal, literature. Lots of fascinating detail from a number of viewpoints – from 29 interviews and 190 detailed responses to a survey
What impact are your online resources having? calculate your ROI
This toolkit is designed and offered by the JISC to provide a guide to measuring the impacts of online scholarly resources.
It 1will help content creators, publishers and other information professionals understand the reach of their digital assets.
They can use the kit to help guide them through different aspects of measuring impact, both qualitative, such as focus groups, and quantitative, such as web metrics.
Bibliometrics can be fun!
With his world map of scientific collaboration, Oliver Beauchesne, from the US-Canada based Science-Metrix, has built nice visualizaion of science collaboration.
It is based on Scopus data. For those interested in looking at how scientists are connected geographically, a number of companies already promise to help map the geographic reach of an individual or a discipline. These include Springer’s AuthorMapper, Transinsight’s GoPubMed and BioMedExperts.
Van Noorden, Richard. Picture post: world map of scientific collaboration. The Great Beyond (Nature), Posted on January 27,2011.
http://blogs.nature.com/news/thegreatbeyond/2011/01/picture_post_world_map_of_scie_1.html

BioSumm: a summarizer for biology-related texts
BioSumm is a flexible and modular framework which analyzes large collections of unclassified biomedical texts and produces ad hoc summaries oriented to biological information.
BioSumm is neither a traditional summarizer nor a extractor of dictionary terms. It is designed to be a summarizer oriented to the biological domain. Thus, its summaries have both the expressive power of the traditional summaries and the domain specificity of documents produced by a dictionary entry extractor.
The GUI interface is freely downloadable:
https://dbdmg.polito.it/twiki/bin/view/Public/BioSumm
Update (31/01/2011): New address:
http://dbdmg.polito.it/wordpress/research/bioinformatics/biosumm/
Real-time science!
STM publisher Springer has announced the launch of a new free analytics tool, http://realtime.springer.com, which provides multiple visualisations of the usage that is generated worldwide by springer’s online products, including journals, books, images and protocols.
Realtime.springer.com aggregates the raw data on downloads of Springer journal articles and book chapters in real time from all over the world, and displays them in a variety of interactive visualisations such as:
- a map showing where the downloads are coming from
- a constantly updating keyword tag cloud
- and a visualisation of total downloads.
In addition, a search feature shows a chart of the downloads and the ‘Top Five Most Downloaded’ list for every journal or book.
The results provide book authors and journal editors with information on how intensively their content is used.
They gain insight into what topics are trending at the moment, and which areas of the world are currently looking at what type of topics in Springer books and journals.
Librarians get a clear overview of where Springer content is used in the many fields.
Realtime.springer.com currently receives input from the information platform SpringerLink with nearly five million documents from about 41,000 eBooks, 1,160 book series, 2,524 journals and 173 eReference works. Additionally, the tool receives feeds from the SpringerImages database with more than 2.7 million images and from SpringerProtocols, the database of reproducible laboratory protocols in the life and biomedical sciences
Visualisation tools conference with Microsoft, Elsevier, etc. speakers
The International Council for Scientific & Technical Information (ICSTI) has announced that its Winter Meeting will be held from February 6-7, 2011, and will be followed by a Workshop, the theme for which will be ‘Multimedia and Visualisation Innovations for Science’, on February 8, 2010.
The Workshop will be open to both members and non-members. Both events will be hosted by Microsoft, an ICSTI member, on their Redmond campus, WA, USA.
Multimedia and visualisation tools and technology are seen to offer tremendous opportunity for accelerating scientific discovery.
This workshop will feature leading-edge innovations in science-oriented web multimedia, large-scale data exploration and visualisation, speech and object recognition, image indexing and analysis, human/computer interaction and virtual environments, among other topics.
Presentations will be made by technology, science, and information professionals across the broad spectrum of academia, government, business, and industry.
visit http://www.icsti.org
Solutions to Information overload
The solution to data overload is to provide decision makers with “Intelligent Information” – better organised and structured information rapidly conveyed to the users’ preferred devices, says Thomson-Reuters experts in a new study.
In todays’ times, as info pros are overwhelmed by exploding data volumes, they do tend to employ an overly intuitive decision-making style when faced with unorganised information.
Thomson InCites: new release
Via one convenient, web-based platform, InCites delivers objective measures of institutional research performance allowing professionals to make strategic choices to effectively further their research, budgetary, hiring, and market positioning goals.
InCites now creates institution and author profile reports that provide top-level snapshots of performance and gives users the ability to dive into core data for deep analysis.
It also supports customization of the data and flexible tools to manage, refine, share, and save reports.
Now InCites users can easily create a picture of their research output and impact at the author and department level, as well as the institution level available in previous versions.
Press release: http://thomsonreuters.com/content/press_room/sci/New-Version-InCites
Web site:
http://incites.thomsonreuters.com/

