{"id":2417,"date":"2017-01-05T15:05:22","date_gmt":"2017-01-05T14:05:22","guid":{"rendered":"http:\/\/blogit.itu.dk\/ethos\/?p=2417"},"modified":"2017-02-23T10:01:30","modified_gmt":"2017-02-23T09:01:30","slug":"mining-with-hyphe-reflections-on-publicethos-13","status":"publish","type":"post","link":"https:\/\/blogit.itu.dk\/ethoslab\/2017\/01\/05\/mining-with-hyphe-reflections-on-publicethos-13\/","title":{"rendered":"Mining with Hyphe"},"content":{"rendered":"<p><em>Reflections on publicETHOS 13# written by Mace Ojala.&nbsp;<\/em><em>A student, a&nbsp;Software Studies scholar, and&nbsp;all around academic elf.&nbsp;<\/em><em>University of Tampere\/IT University of Copenhagen\/University of Copenhagen.&nbsp;<\/em><\/p>\n<p>A fruitful invitation for us in Science and Technology Studies (STS) to reflect on our engagement&nbsp;with tools&nbsp;was presented in a&nbsp;<a href=\"https:\/\/ethos.itu.dk\/2016\/11\/14\/publicethos-hyphe\/\">publicETHOS&nbsp;workshop on 1.12.2016<\/a>. The afternoon&nbsp;workshop run by&nbsp;Mathieu Jacomy&nbsp;(<a href=\"https:\/\/twitter.com\/jacomyma\" target=\"_blank\">@jacomyma<\/a>)&nbsp;was a walkthrough of web scraping with&nbsp;<a href=\"http:\/\/tools.medialab.sciences-po.fr\/#hyphe\">Hyphe<\/a>, an open-source, free&nbsp;tool developed at&nbsp;<a href=\"http:\/\/www.medialab.sciences-po.fr\/\">Sciences Po m\u00e9dialab<\/a>&nbsp;where Jacomy works. The m\u00e9dialab designs and build tools for social scientists in particular, Hyphe being one of them. What does Hyphe propose as a tool, and what would it take for it to be entrenched in an academic community like the&nbsp;ITU?<\/p>\n<p><a href=\"https:\/\/pure.itu.dk\/portal\/en\/persons\/marie-bloend(08218d20-6a84-4ae9-94a6-ef4ab25984e7).html\">Marie Bl\u00f8nd<\/a>&nbsp;(<a href=\"https:\/\/twitter.com\/ITU_Marie\" target=\"_blank\">@ITU_Marie<\/a>), lab manager of&nbsp;ETHOS Lab&nbsp;and a graduate of the&nbsp;Digital Innovations Management&nbsp;(DIM) says Hyphe has developed since she&nbsp;learned&nbsp;about it on the Navigating Complexity course during her studies. Compared with other tools such as&nbsp;<a href=\"http:\/\/www.tableau.com\/\">Tableau<\/a>,&nbsp;Hyphe feels less like a black box. The difference stems from iterative,&nbsp;reflexive engagement with the data that Hyphe affords. Seen as a web search, an alternative to results-oriented&nbsp;tools such as the ubiquitous&nbsp;Google&nbsp;or even&nbsp;REX, Hyphe postpones results until the laborious work of corpus curation has been engaged with. Instead of querying a pre-given set of webpages or sites, the workflow starts by the researcher&nbsp;explicitly&nbsp;selecting&nbsp;URLs into the set to be scraped, and then crawled for further links.<\/p>\n<p>Superficially speaking, the output of Hyphe is&nbsp;typical force-directed&nbsp;network visualisations, now familiar to all of us and part of the repertoire of so-called&nbsp;digital methods. They attract critique, and rightly so.&nbsp;Marie however argues that the valuable output of Hyphe is reflexive process with the data.&nbsp;Working with&nbsp;corpus curation&nbsp;keeps the data and the&nbsp;data sources from being hidden behind a surface, and helps verify and analyse them qualitatively. The removal of&nbsp;automatic crawling of \u201call the web\u201d is where the heart of Hyphe\u2019s value proposition seems to be.<\/p>\n<p>Other than visualising the&nbsp;hyperlinks between web entities, the graph can be exported to&nbsp;<a href=\"http:\/\/gephi.org\/\">Gephi<\/a>&nbsp;or other tools&nbsp;for further qualitative or quantitative&nbsp;analysis. Hyphe thus sits as a tool among others, to be assembled into the&nbsp;instrumentation of a given research. As such,&nbsp;Hyphe is one&nbsp;implementation of the kind of tool to investigate that&nbsp;<a href=\"http:\/\/www.uva.nl\/over-de-uva\/organisatie\/medewerkers\/content\/r\/o\/r.a.rogers\/r.a.rogers.html\">Richard Rogers<\/a>&nbsp;(<a href=\"https:\/\/twitter.com\/richardrogers\" target=\"_blank\">@richardrogers<\/a>)&nbsp;from&nbsp;<a href=\"https:\/\/wiki.digitalmethods.net\/Dmi\/DmiAbout\">Digital Methods Initiative<\/a>&nbsp;(DMI&nbsp;<a href=\"https:\/\/twitter.com\/digitalmethods\" target=\"_blank\">@digitalmethods<\/a>) at&nbsp;<a href=\"https:\/\/ethos.itu.dk\/2016\/10\/24\/2238\/\">an earlier publicETHOS<\/a>&nbsp;event characterised as natively digital data. There is tremendous value in the extra design&nbsp;effort that DMI, Sciences Po and others purposefully&nbsp;invest into the&nbsp;software to make them more accessible to the wider research community, as&nbsp;social sciences and humanities alike adopt computational thinking into research. Despite this,&nbsp;the digital\/non-digital remains a complex intertwine. Data, methods, nor tools warrant natural discrimination between one or the other, associate professor&nbsp;<a href=\"https:\/\/pure.itu.dk\/portal\/en\/persons\/luca-rossi(1b45e3f5-b8cd-4c0d-a444-2901304f76ee).html\">Luca Rossi<\/a>&nbsp;(<a href=\"https:\/\/twitter.com\/LR\" target=\"_blank\">@lr<\/a>) from&nbsp;<a href=\"https:\/\/pure.itu.dk\/portal\/en\/organisations\/digital-society-and-communication(eddb0b6e-52f3-4d89-b342-296d320a32cd).html\" target=\"_blank\">Digital Society and Communication<\/a>&nbsp;(DiSCo)&nbsp;section reminds us.<\/p>\n<p>The crucial question when working with Hyphe&nbsp;is of course what to include in&nbsp;the&nbsp;corpus \u2014 which criteria to adopt?&nbsp;<a href=\"https:\/\/pure.itu.dk\/portal\/en\/persons\/lea-schick(aa2cb9d0-d2d3-4377-8540-ea9b98e3b181).html#person-tabs\">Lea Schick<\/a>, (<a href=\"https:\/\/twitter.com\/schicku\" target=\"_blank\">@schicku<\/a>)&nbsp;postdoc at the&nbsp;<a href=\"https:\/\/itu.dk\/tip\/\">Technologies in Practice<\/a>&nbsp;(TiP) research group raised this question to Mathieu at the workshop, who refrained from providing&nbsp;generic answer. Instead, such decisions&nbsp;would necessarily be made from the perspective of particular&nbsp;research undertaken. The afternoon workshop centered around an artificial research scenario of mapping controversies around AIDS.&nbsp;A follow-up workshop where people would bring their own research could be useful, Lea hinted, grounding the Hyphe workflow and discussing specific questions.<\/p>\n<p>I was recently working as a research assistant for <a href=\"https:\/\/pure.itu.dk\/portal\/en\/persons\/marisa-cohn(e9d14210-bc71-4824-b7cf-fd1448d022ab).html\" target=\"_blank\">Marisa Cohn<\/a> at TiP, and we were looking at legacy software, legacyness, decay, discourses,&nbsp;sustenance and&nbsp;maintenance, entanglements and torques of bodies, biographies and softwares, how time plays out in&nbsp;software and that sorts of things. Were we&nbsp;using Hyphe? No, we were not. We had an eclectic corpus we were working with, but the fundamental Hyphe&nbsp;assumption of&nbsp;hyperlinks carrying information did&nbsp;not hold. The research object we collected was hyperlink-poor, consisting of web forum discussions threads, individual&nbsp;blogposts, PDFs, images and other sorts of digital documents which don\u2019t express much&nbsp;hypertextuality. Therefore&nbsp;web crawling and hyperlink analysis did not seem like an appropriate idea. Had we chosen our&nbsp;collection differently, to match the assumptions, Hyphe&nbsp;might have been valuable. Web crawling itself&nbsp;is a non-trivial problem, and accepting some&nbsp;\u201cwhite box\u201d (semi-opaque, interrogatable) instruments would have been welcome, instead of building research automation&nbsp;from scratch. Similarly, had we committed to Hyphe early on, that choice of a tool would surely guided us towards data sources compatible with the tools&nbsp;assumptions, and a different research object would have been created.<\/p>\n<p>\/<a href=\"https:\/\/pure.itu.dk\/portal\/en\/persons\/mace-ojala(72fb4ba1-ee58-4e93-9cdd-4e9855fa1c3e).html\" target=\"_blank\">Mace&nbsp;<\/a><em>(<a href=\"http:\/\/twitter.com\/xmacex\">@xmacex<\/a><\/em><em>)&nbsp;<\/em><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Reflections on publicETHOS 13# written by Mace Ojala.&nbsp;A student, a&nbsp;Software Studies scholar, and&nbsp;all around academic elf.&nbsp;University of Tampere\/IT University of Copenhagen\/University of Copenhagen.&nbsp; A fruitful invitation for us in Science [&hellip;]<\/p>\n","protected":false},"author":91,"featured_media":2453,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_et_pb_use_builder":"","_et_pb_old_content":"","ngg_post_thumbnail":0,"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":true,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":false,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[1,51],"tags":[],"class_list":["post-2417","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-news","category-blog"],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"https:\/\/i0.wp.com\/blogit.itu.dk\/ethoslab\/wp-content\/uploads\/sites\/92\/2017\/01\/miningf%C3%A6rdig.png?fit=1056%2C366&ssl=1","jetpack_sharing_enabled":true,"jetpack-related-posts":[],"_links":{"self":[{"href":"https:\/\/blogit.itu.dk\/ethoslab\/wp-json\/wp\/v2\/posts\/2417","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blogit.itu.dk\/ethoslab\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blogit.itu.dk\/ethoslab\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blogit.itu.dk\/ethoslab\/wp-json\/wp\/v2\/users\/91"}],"replies":[{"embeddable":true,"href":"https:\/\/blogit.itu.dk\/ethoslab\/wp-json\/wp\/v2\/comments?post=2417"}],"version-history":[{"count":25,"href":"https:\/\/blogit.itu.dk\/ethoslab\/wp-json\/wp\/v2\/posts\/2417\/revisions"}],"predecessor-version":[{"id":2811,"href":"https:\/\/blogit.itu.dk\/ethoslab\/wp-json\/wp\/v2\/posts\/2417\/revisions\/2811"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/blogit.itu.dk\/ethoslab\/wp-json\/wp\/v2\/media\/2453"}],"wp:attachment":[{"href":"https:\/\/blogit.itu.dk\/ethoslab\/wp-json\/wp\/v2\/media?parent=2417"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blogit.itu.dk\/ethoslab\/wp-json\/wp\/v2\/categories?post=2417"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blogit.itu.dk\/ethoslab\/wp-json\/wp\/v2\/tags?post=2417"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}