{"id":2722,"date":"2019-10-14T10:20:00","date_gmt":"2019-10-14T07:20:00","guid":{"rendered":"https:\/\/dh.org.ee\/?page_id=2722"},"modified":"2019-10-14T10:20:00","modified_gmt":"2019-10-14T07:20:00","slug":"workshops","status":"publish","type":"page","link":"https:\/\/dh.org.ee\/en\/events\/dhe2019\/abstracts\/workshops\/","title":{"rendered":"Workshops"},"content":{"rendered":"\n<h3 class=\"wp-block-heading\">A hands-on data exploration\u00a0&amp; challenge to become a derived data-set author on the British Library\u2019s open data-set platform (<a rel=\"noreferrer noopener\" href=\"https:\/\/data.bl.uk\/\" target=\"_blank\">data.bl.uk<\/a>)<\/h3>\n\n\n\n<p><em>Speaker: Mahendra Mahey<\/em><\/p>\n\n\n\n<ul class=\"wp-block-list\"><li>Do      you want to understand some of the challenges of working with cultural      heritage data in a large&nbsp;national library such as the British      Library?<\/li><li>Do      you want to explore and get some &#8216;hands-on&#8217; experience of working with the      British Library\u2019s digital collections and data?<\/li><li>Do      you want to leave a \u2018legacy\u2019 of being a data-set author\/creator\/curator on      the British Library\u2019s data-set platform?<\/li><li>Do      you have&nbsp;<strong>some digital&nbsp;literacy in using familiar data      exploration tools<\/strong>&nbsp;such as Microsoft Excel (see &#8216;<strong>GUIDANCE FOR THIS WORKSHOP<\/strong>&#8216; below)?<\/li><\/ul>\n\n\n\n<p><strong>If the answer is &#8216;Yes&#8217; to any of these,\nthen this workshop&nbsp;could be for you!<\/strong><\/p>\n\n\n\n<p><a href=\"http:\/\/dha-2018.p.dha2018.currinda.com\/speaker\/207\" target=\"_blank\" rel=\"noreferrer noopener\">Mahendra Mahey<\/a>, manager of&nbsp;<a href=\"https:\/\/www.bl.uk\/projects\/british-library-labs\" target=\"_blank\" rel=\"noreferrer noopener\">British Library Labs<\/a>&nbsp;(<a href=\"https:\/\/www.bl.uk\/projects\/british-library-labs\" target=\"_blank\" rel=\"noreferrer noopener\">BL Labs<\/a>)&nbsp;will examine some of the BL\u2019s&nbsp;<a href=\"https:\/\/data.bl.uk\/\" target=\"_blank\" rel=\"noreferrer noopener\">digital collections\/data<\/a>&nbsp;&amp; discuss challenges&nbsp;he\nhas&nbsp;had in making the BL&#8217;s cultural heritage data available openly or\nonsite at the British Library.<\/p>\n\n\n\n<p>Mahendra will invite delegates to explore data-sets at their leisure, setting a challenge for those who are interested, skilled in exploring, finding patterns and grouping\u00a0data. They could become data-set authors\/creators of derived data-sets, based on pre-existing digital collections\/data provided on the day or already available on\u00a0<a rel=\"noreferrer noopener\" href=\"https:\/\/data.bl.uk\/\" target=\"_blank\">data.bl.uk<\/a>.\u00a0<\/p>\n\n\n\n<p>The workshop will conclude with reflections from the delegates and possibly highlighting\u00a0a number derived data-sets that were generated by participants on the day that could now potentially exist on\u00a0<a rel=\"noreferrer noopener\" href=\"https:\/\/data.bl.uk\/\" target=\"_blank\">data.bl.uk<\/a>.\u00a0If selected, these\u00a0new derived data-sets will be attributed with the creators&#8217; \/ authors&#8217; details and each\u00a0will have its own cite-able Digital Object Identifier (D.O.I). These new data-sets would then be available for reuse by any researcher in the world.<\/p>\n\n\n\n<p><strong>GUIDANCE FOR THIS WORKSHOP<\/strong><\/p>\n\n\n\n<p>We&nbsp;<strong>strongly recommend<\/strong>&nbsp;you\ncome to this workshop with an appropriate device such as a&nbsp;<strong>laptop&nbsp;<\/strong>pre-installed\nwith appropriate tools to&nbsp;analayse different kinds of data-sets, e.g.\nMicrosoft Excel may work with smaller data-sets such as metadata (see other\ndata exploration tools below). If you don&#8217;t have one, and would still like to\nattend, please request to &#8216;pair up&#8217; with someone who is willing to share\nand&nbsp;has already signed up.<\/p>\n\n\n\n<p><strong>Other data exploration tools\ninclude:&nbsp;<\/strong><a href=\"https:\/\/notepad-plus-plus.org\/\" target=\"_blank\" rel=\"noreferrer noopener\">Notepad++<\/a>&nbsp;(e.g. for viewing text and\nXML);&nbsp;<a href=\"http:\/\/openrefine.org\/\" target=\"_blank\" rel=\"noreferrer noopener\">Open\nRefine<\/a>&nbsp;(e.g.\nfor cleaning data);&nbsp;<a href=\"https:\/\/public.tableau.com\/en-us\/s\/\" target=\"_blank\" rel=\"noreferrer noopener\">Tableau Public<\/a>&nbsp;(e.g. for visualising data);&nbsp;<a href=\"https:\/\/geoawesomeness.com\/make-awesome-interactive-map-using-google-sheets-1-minute\/\" target=\"_blank\" rel=\"noreferrer noopener\">Google Sheets<\/a>&nbsp;(e.g for visualising geo-spatial\ndata);&nbsp;<a href=\"https:\/\/spacy.io\/\" target=\"_blank\" rel=\"noreferrer noopener\">Spacy<\/a>&nbsp;(e.g. for text and data\nmining),&nbsp;<a href=\"https:\/\/www.rstudio.com\/\" target=\"_blank\" rel=\"noreferrer noopener\">RStudio<\/a>&nbsp;(an open source Statistical package),&nbsp;<a href=\"https:\/\/www.mathworks.com\/products\/matlab.html\" target=\"_blank\" rel=\"noreferrer noopener\">MATLAB<\/a>&nbsp;(data analysis tool)&nbsp;&amp;&nbsp;<a href=\"https:\/\/www.nltk.org\/\" target=\"_blank\" rel=\"noreferrer noopener\">NLTK<\/a>&nbsp;(Natural Language processing).&nbsp;<\/p>\n\n\n\n<p><strong>Please note that this workshop is&nbsp;NOT\nabout training you in using any of these tools, just tools you may be already\nfamiliar with to explore and&nbsp;find patterns in&nbsp;our data.<\/strong><\/p>\n\n\n\n<p>Datatypes you may be examining in this\nworkshop could include:&nbsp;<a href=\"https:\/\/en.wikipedia.org\/wiki\/Zip_(file_format)\" target=\"_blank\" rel=\"noreferrer noopener\">.ZIP<\/a>,&nbsp;<a href=\"https:\/\/en.wikipedia.org\/wiki\/PDF\" target=\"_blank\" rel=\"noreferrer noopener\">.PDF<\/a>,&nbsp;<a href=\"https:\/\/en.wikipedia.org\/wiki\/Text_file\" target=\"_blank\" rel=\"noreferrer noopener\">.TXT<\/a>,&nbsp;<a href=\"https:\/\/en.wikipedia.org\/wiki\/Comma-separated_values\" target=\"_blank\" rel=\"noreferrer noopener\">.CSV<\/a>,&nbsp;<a href=\"https:\/\/en.wikipedia.org\/wiki\/TSV\" target=\"_blank\" rel=\"noreferrer noopener\">.TSV<\/a>.&nbsp;<a href=\"https:\/\/en.wikipedia.org\/wiki\/Microsoft_Excel_file_format\" target=\"_blank\" rel=\"noreferrer noopener\">.XLS<\/a>,&nbsp;.<a href=\"http:\/\/whatis.techtarget.com\/fileformat\/XLSX-Microsoft-Excel-Open-XML-Document\" target=\"_blank\" rel=\"noreferrer noopener\">XLSX<\/a>,&nbsp;<a href=\"https:\/\/en.wikipedia.org\/wiki\/Resource_Description_Framework\" target=\"_blank\" rel=\"noreferrer noopener\">RDF<\/a>,&nbsp;<a href=\"https:\/\/en.wikipedia.org\/wiki\/N-Triples\" target=\"_blank\" rel=\"noreferrer noopener\">.nt<\/a>,&nbsp;<a href=\"https:\/\/en.wikipedia.org\/wiki\/XML\" target=\"_blank\" rel=\"noreferrer noopener\">XML<\/a>&nbsp;(<a href=\"https:\/\/en.wikipedia.org\/wiki\/Text_Encoding_Initiative\" target=\"_blank\" rel=\"noreferrer noopener\">TEI<\/a>,&nbsp;<a href=\"https:\/\/en.wikipedia.org\/wiki\/ALTO_(XML)\" target=\"_blank\" rel=\"noreferrer noopener\">ALTO<\/a>&nbsp;and bespoke),&nbsp;<a href=\"https:\/\/en.wikipedia.org\/wiki\/JSON\" target=\"_blank\" rel=\"noreferrer noopener\">.JSON<\/a>,&nbsp;<a href=\"https:\/\/en.wikipedia.org\/wiki\/JPEG\" target=\"_blank\" rel=\"noreferrer noopener\">.JPG,&nbsp;.JPEG<\/a>,&nbsp;.<a href=\"https:\/\/en.wikipedia.org\/wiki\/TIFF\" target=\"_blank\" rel=\"noreferrer noopener\">TIFF<\/a>&nbsp;and&nbsp;.<a href=\"https:\/\/en.wikipedia.org\/wiki\/Web_ARChive\" target=\"_blank\" rel=\"noreferrer noopener\">WARC<\/a><\/p>\n\n\n\n<p>Please ensure you are able to read these\nfiles on your device&nbsp;<strong>before the workshop<\/strong>&nbsp;if you are interested\nin exploring them during our session.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Cadaver.exe (Forays into New Digital Poetry) <\/h3>\n\n\n\n<p><em>Speaker: Justice (Ruby) Th\u00e9lot<\/em><\/p>\n\n\n\n<p>New ideas exist in front of us, in the web of context, they are dormant and we need to unearth them. Borrowing from the practices of Surrealist and Automatist writers and artists, this workshop seeks to utilize the collective unconscious (accessed through the &#8220;feed&#8221;) in order to come up with funny, exciting, new, associations. What we will be doing basically is a digital exquisite cadaver with our timelines. Copying and posting screen-grabs onto a digital canvas at random. We will utilize our own likes, follows and cookies to pierce through and bring forth the unconscious language of the timeline.<\/p>\n\n\n\n<p>\nParticipants\nwill come out of the workshop with digital collages, poems,\npot-pourris of posts which they can assemble into mini-zines, digital\nor even analog by printing them. The goal being to broaden the scope\nof what can be considered as &#8220;digital literature&#8221;.<\/p>\n\n\n\n<p>\nI\nwant participants to PLAY! An essential part of Dadaist philosophy\nwas indeed the notion that we could bring a child like energy and fun\nto art, poetry and writing. This workshop is sure to create uncanny\njuxtapositions and engender laughter in the group.<\/p>\n\n\n\n<p>\nThis\nis essentially an exercise in new-automatic-writing. We are composing\nnew texts (poems) or collage from posts that already exist. Scrolling\nhas become a modern ritual. We do it religiously 5 times a day, at\ndusk and at dawn. We will unearth the unspoken text of our timelines,\nwe will see our activity of scrolling under a new lens and\neffectively we shall transform this often passive looking process\n(looking at the feed) into an active creative process (making\nsomething with the feed).<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Python for Digital Humanities <\/h3>\n\n\n\n<p><em>Speaker: Sree Ganesh Thotempudi<\/em><\/p>\n\n\n\n<p>The first part of the workshop will be spent learning the basics of the Python programming language. We will start from the assumption that the students have never used Python and move them through the basics of the language. The second part will focus on using these newly gained programming skills to automatically manipulate XML data. Using the lxml and Beautiful Soup Python libraries, we will take data from the Perseus repository and convert the TEI-XML in the repository to CTS-compliant XML and then feed the data back into the Perseus repository. In this second part, the students will design and implement their own XML manipulation pipeline that they will then be able to use later to automatically manipulate or convert large corpora of XML texts from one format to another.<\/p>\n\n\n\n<p>\nBesides\nthe basic skills in Python programming, the students will also learn\nto manipulate text using regular expressions, to expand the\ncapabilities of Python by installing external libraries, and to use\nPython to make basic API calls, using the Perseus API as an example\ncase. We expect you to have some familiarity XML and, preferably,\nsome experience with a scripting or programming language. We also\nexpect you to have either a Linux or Mac computer. Any Windows users\nwill be expected to install a Linux virtual machine before the start\nof the workshop.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Accessing texts and data in the collections of the National Library of Estonia <\/h3>\n\n\n\n<p><em>Speaker: Peeter Tinits<\/em><\/p>\n\n\n\n<p>The workshop presents a practical introduction to the use of textual data and metadata within the collections of the National Library of Estonia.<\/p>\n\n\n\n<p>\nFor\nexample, the collections currently house a digital collection of\naround ~3.4M pages (~6.0M articles) from ~2,000 periodicals\n(1821&#8230;2019) and ~25,000 book-length publications as well as a\nmetadata registry for ~300,000 printed publications published in\nEstonian or in connection to Estonia among other sources. These\nmaterials are the result of decades of digitization and data\ncollection, while steps have always been taken to make the\ncollections useful also for researchers. With the developments of the\ntechnological toolkit of a researcher in social sciences and digital\nhumanities, these collections can find new value also within the\nresearch communities.<\/p>\n\n\n\n<p>\nThe\nworkshop will offer a practical introduction to the access of textual\ndata and metadata in the library collections, explain to what degree\nthis can be done and how, and offer some use cases for this. It will\nintroduce ongoing efforts at the library to keep improving this\naccess and discuss also the future plans on this. Particularly, the\nworkshop will look to its participants to understand the features\nthat would make it most useful for researchers, and interesting for\nthe general public. The participant should expect to walk away with\nsome practical understanding of how the data can be accessed as well\nas ideas on what this could be used for.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Language Resources for Content Search and Metadata Search <\/h3>\n\n\n\n<p><em>Speakers: Kadri Vider, Neeme Kahusk, Olga Gerassimenko<\/em><\/p>\n\n\n\n<p>The Center of Estonian Language Resources offers a 3-hour tutorial of main types of European language resources for the broader audience of DH researchers that might be interested in language resources usage.<\/p>\n\n\n\n<p>\nDigital\nHumanities researchers (<strong>max\n20 participants<\/strong>)\ninterested in using Language Resources are invited to learn to\nperform content and metadata search in the Estonian and European\nLanguage Resources. We will demonstrate the main types of Language\nResources of <a href=\"https:\/\/www.clarin.eu\/\">CLARIN<\/a>,\nEuropean Research Infrastructure of Language Resources and\nTechnology, and <a href=\"https:\/\/keeleressursid.ee\/en\/\">Center\nof Estonian Language Resources<\/a>\nas CLARIN national center.<\/p>\n\n\n\n<p>\n1.\nKORP is a corpus query system that allows to perform flexible and\nintricate corpus searches based on all the features tagged or\nsystematically appearing in content and metadata. KORP was created in\nG\u00f6teborg University Language Bank <a href=\"https:\/\/spraakbanken.gu.se\/eng\/korp\">Spr\u00e5kbanken<\/a>\nand is being developed in several other countries besides Sweden and\n<a href=\"https:\/\/korp.keeleressursid.ee\/\">Estonia<\/a>:\nFinland Language Bank <a href=\"https:\/\/korp.csc.fi\/\">Kielipankki<\/a>,\nNorway Centre for Saami language technology <a href=\"http:\/\/gtweb.uit.no\/korp\/\">Giellatekno<\/a>,\n<a href=\"https:\/\/alf.hum.ku.dk\/korp\/\">Denmark<\/a>,\n<a href=\"http:\/\/malheildir.arnastofnun.is\/\">Iceland<\/a>.<\/p>\n\n\n\n<p>Tutorial (90 minutes) will teach to use the corpora found in Estonian and Finnish KORP, to make best use of simple, extended and advanced search interface, to export the results for the work with statistics programs. We will demonstrate the restricted access (text and speech) corpora that are available to the academical users through a Single Sign-On technology.<\/p>\n\n\n\n<p>2. <a href=\"https:\/\/raba.keeleressursid.ee\/\">RABA<\/a> is an Estonian Federated Content Search system that uses both text and speech corpora and lexical resources such as dictionaries to perform quick and efficient content search in differently annotated differently organised resources.<\/p>\n\n\n\n<p>\nTutorial\n(60 minutes) will teach to use simple and advanced query interfaces\nand make use of different data collections included in the content\nsearch. We will also demonstrate the usage of CLARIN <a href=\"https:\/\/spraakbanken.gu.se\/ws\/fcs\/2.0\/aggregator\/\">Federated\nContent Search<\/a>\nthat covers other CLARIN data resources and collections.<\/p>\n\n\n\n<p>3. <a href=\"https:\/\/vlo.clarin.eu\/;jsessionid=C20834457A364710329162FAB889DEF4?0\">Virtual Language Observatory<\/a> is the most comprehensive CLARIN browser search system built to automatically harvest language resources, tools and services of CLARIN centers and to explore the large number of resources from various domains and providers in a uniform way.<\/p>\n\n\n\n<p>Tutorial (30 minutes) will show the possibilities of narrowing and broadening the search in VLO to find specific resources or similar resources. We will demonstrate the smart usage of search facets and visualized uniform information about resources. Estonian register <a href=\"https:\/\/metashare.ut.ee\/\">META-SHARE<\/a> that provides VLO with data will also be demonstrated and taught in order to register the new resources that the participants of the tutorial might wish to add to the database.<\/p>\n\n\n\n<p>The tutorial will be held in English with a focus on the Estonian resources but the knowledge of Estonian is not required to participate. The participants would need to bring their own laptops. Stable internet connection is necessary.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>A hands-on data exploration\u00a0&amp; challenge to become a derived data-set author on the British Library\u2019s open data-set platform (data.bl.uk) Speaker: Mahendra Mahey Do you want to understand some of the challenges of working with cultural heritage data in a large&nbsp;national library such as the British Library? Do you want to explore and get some &#8216;hands-on&#8217; &hellip; <\/p>\n<p class=\"link-more\"><a href=\"https:\/\/dh.org.ee\/en\/events\/dhe2019\/abstracts\/workshops\/\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;Workshops&#8221;<\/span><\/a><\/p>\n","protected":false},"author":2,"featured_media":0,"parent":3169,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-2722","page","type-page","status-publish","hentry"],"jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/dh.org.ee\/en\/wp-json\/wp\/v2\/pages\/2722","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dh.org.ee\/en\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/dh.org.ee\/en\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/dh.org.ee\/en\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/dh.org.ee\/en\/wp-json\/wp\/v2\/comments?post=2722"}],"version-history":[{"count":0,"href":"https:\/\/dh.org.ee\/en\/wp-json\/wp\/v2\/pages\/2722\/revisions"}],"up":[{"embeddable":true,"href":"https:\/\/dh.org.ee\/en\/wp-json\/wp\/v2\/pages\/3169"}],"wp:attachment":[{"href":"https:\/\/dh.org.ee\/en\/wp-json\/wp\/v2\/media?parent=2722"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}