• Ingen resultater fundet

Geospatial data and Scholia

N/A
N/A
Info
Hent
Protected

Academic year: 2022

Del "Geospatial data and Scholia"

Copied!
22
0
0

Indlæser.... (se fuldtekst nu)

Hele teksten

(1)

Finn ˚Arup Nielsen, Daniel Mietchen, Egon Willighagen

Cognitive Systems, DTU Compute, Technical University of Denmark;

Data Science Institute, University of Virginia; Dept of Bioinformatics - BiGCaT, NUTRIM, Maastricht University

3 June 2018

(2)
(3)

How much geospatial data do we have?

5’855’337 Wikidata geocoor- dinate links according to the query:

S E L E C T ( C O U N T (*) AS ? cnt ) W H E R E { [] wdt : P 6 2 5 []. }

From Chinese village, Dutch roads, Danish restaurants, . . . The numbers: around 48 million Wikidata items, over 5 milliard triples, over 13 million DOI links, around 39 thousand geolocatable topics of works with DOI: SELECT (COUNT(*) AS ?count) WHERE { [] wdt:P356 [] ; wdt:P921 / wdt:P625 [] }

(4)

Application of Wikidata geospatial data

There are several applications using geospatial data from Wikidata.

Magnus Manske has produced Rea- sonator that displays a map for a spe- cific Wikidata item and Wikishootme that shows a map with geolocatable Wikidata items missing an image (see screenshot).

You can also discover Wikidata items near your with the special URL https://

www.wikidata.org/wiki/Special:Nearby in the MediaWiki software.

(5)

Scholia

Scholia is a webservice running from https://tools.wmflabs.org/scholia/

Display information from Wikidata about researchers, works and their ci- tations, organizations, venues, events, topics, etc.

Panels for each Wikidata item con- structed with calls to the Wiki- data Query Service (WDQS), show- ing tables and plots such as bub- ble charts and line plot as well as OpenStreetMap-based maps.

(6)

Geospatial data with Scholia: topic

Maps established with simple queries to WDQS.

Find works about a topic (here Ma- yaro virus) with a SPARQL path query:

? w o r k wdt : P 9 2 1 /

( wdt : P31 */ wdt : P 2 7 9 *

| wdt : P 3 6 1 + | wdt : P 1 2 6 9 + ) wd : Q 1 8 8 6 3 9 5 3 .

Identify co-occuring topic that is geo-locatable.

? w o r k wdt : P 9 2 1 ? l o c a t i o n .

? l o c a t i o n wdt : P 6 2 5 ? geo .

(7)

Geospatial data with Scholia: author

Author aspect: /scho- lia/author/Q20980928

With node coloring controlled by the type of property.

(8)

New Scholia aspects: location and country

(9)

Geospatial data with Scholia: location

Scholia location as- pect for a Cretian ho- tel:

/location/Q47259960

SPARQL query with the distance function called geof:distance showing nearby acade- mic institutions.

Other panel: Nearby locations as topics in works. Identifies, e.g., Tomb Robbing and the Transformation of Social Memory in Roman Knossos (Grigoropulous, 2004) as an article with a nearby topic.

(10)

Geospatial data with Scholia: location

SPARQL query for identifying nearby academic institutions

# A c a d e m i c i n s t i t u t i o n

V A L U E S ? u n i v e r s i t y { wd : Q 3 9 1 8 wd : Q 1 3 7 1 0 3 7 wd : Q 7 3 1 5 1 5 5 wd : Q 3 1 8 5 5 }

...

# F i n d i n d i v i d u a l u n i v e r s i t i e s and d e p a r t m e n t s

# and the g e o c o o r d i n a t e

? o r g a n i z a t i o n wdt : P 3 6 1 * / wdt : P31 / wdt : P 2 7 9 * ? u n i v e r s i t y .

? o r g a n i z a t i o n wdt : P 6 2 5 ? o t h e r _ g e o . ...

# C o m p u t e d i s t a n c e b e t w e e n a c a d e m i c i n s t i t u t i o n

# and the q u e r y l o c a t i o n

wd : Q 4 7 2 5 9 9 6 0 wdt : P 6 2 5 ? geo .

B I N D ( g e o f : d i s t a n c e (? o t h e r _ g e o , ? geo ) AS ? d i s t a n c e ) F I L T E R (? d i s t a n c e < 2 5 0 )

...

O R D E R BY ? d i s t a n c e

(11)

Countries in Scholia

Map in Scholia with international collab- orators of authors based in the Nether- lands: /scholia/coun- try/Q55

Other map panels for the country aspect display narrative lo- cations within works and location as top- ics in works.

(12)

User stories

(13)

User story: Finnish machine learning

You are to review research applications from Finland about machine learn- ing and related research fields. You are based outside Finland and would like to get an overview of Finnish researchers and research organizations in that research area, their works as well as their collaboration and citation patterns.

(14)

User story: Finnish machine learning

One of the panels on the Finnish machine learning country–topic aspect page with display of the co-author graph.

Using item links between researchers, affiliation and country to identify Finnish researchers with no need for query on geocoordinate data.

Combination of country and topic: /scholia/country/Q33/topic/Q2539

(15)

User story: Wikipedia researchers in T¨ ubingen

You are a researcher interested in Wikipedia research and planning a visit to T¨ubingen where you would like to meet other Wikipedia researchers.

(16)

User story: Wikipedia researchers in T¨ ubingen

Combination of location and topic: /scholia/location/Q3806/topic/Q52.

( SUM (? t o p i c _ s c o r e ) * MAX (? i n v e r s e _ d i s t a n c e ) AS ? s c o r e ) ...

? w o r k wdt : P 9 2 1 / wdt : P 2 7 9 * wd : Q52 . B I N D (3 AS ? t o p i c _ s c o r e ) } U N I O N { ? a u t h o r wdt : P 1 0 1 wd : Q52 . B I N D (20 AS ? t o p i c _ s c o r e ) }

(17)

User story: conference hunter

You are going to The Web Conference in April 2018 in Lyon. You want to know if there is any other relevant scientific meeting in the local area at that time, preferably just before or just after the conference.

(18)

User story: conference hunter

Related events panel for The Web Conference 2018 in Scholia.

(19)

User story: conference hunter

Event aspect for The Web Conference 2018: /scholia/event/Q48910401 where the SPARQL query combines inverse distance and inverse time separation:

B I N D (20 / (1 + ABS (? day - ? d a y 0 )) AS ? t i m e _ s c o r e _ ) ...

# i n v e r s e d i s t a n c e

B I N D ( ( 2 0 0 / (1 + g e o f : d i s t a n c e (? geo , ? g e o 0 ))) AS

? i n v e r s e _ d i s t a n c e ) ...

B I N D ((? t i m e _ s c o r e _ * ? l o c a t i o n _ s c o r e _ ) AS ? s c o r e _ ) ...

O R D E R BY D E S C (? s c o r e )

(20)

Summary

Large amount of geospatial data in Wikidata, including geospatial data tied to scientific items (articles, researchers, organizations).

. . . and continuous expansion of the data.

WDQS makes it easy to create maps of the data in Wikidata.

Scholia uses the capabilities of WDQS to render maps and compute dis- tances for a range of different scholar-associated data.

(21)

References

Grigoropulous, D. (2004). Tomb Robbing and the Transformation of Social Memory in Roman Knossos.

pages 62–77.

(22)

Copyright and license

OpenStreetMap maps are Map c OpenStreetMap contributors. CC BY- SA 2.0.

Wikidata logo by Arun Ganesh (Planemad). It is a trademark of the Wikimedia Foundation.

Wikishootme is the work of Magnus Manske.

Referencer

RELATEREDE DOKUMENTER

It brings together researchers working within energy systems analyses of local, regional and global levels, researchers working on feasibility studies and researchers working on

Daniel Mietchen: Upload of scientific bibliographic data.. San Diego

Professional networks are more important as sources of information to researchers from the Health and Natural Sciences than to researchers from the Social Sciences and Arts

To proceed with this analysis we will concentrate on the case where a monetary union, formed of two countries - Country A and Country B, is accompanied by independent fiscal

On 7-9 September 2007 some 70 language planners, experts in law and onomas- tics, and onomastics researchers from the Nordic countries gathered in Uppsala to discuss the topic

Clustering is all about geometry of unlabeled data (no labeled data!). Need to combine probability density with the geometry of the

The goal of this section is to give some context in terms of Bayesian data analysis, natural language processing, and topic modelling, and to describe the author-topic model and

By combining the fields of art and academia and by inviting amongst others directors, performing artists, drama pedagogues, theatre researchers and researchers of children’s