{"id":731,"date":"2015-11-23T03:18:15","date_gmt":"2015-11-23T02:18:15","guid":{"rendered":"https:\/\/joernhees.de\/blog\/?p=731"},"modified":"2016-09-28T22:42:30","modified_gmt":"2016-09-28T20:42:30","slug":"setting-up-a-linked-data-mirror-from-rdf-dumps-dbpedia-2015-04-freebase-wikidata-linkedgeodata-with-virtuoso-7-2-1-and-docker-optional","status":"publish","type":"post","link":"https:\/\/joernhees.de\/blog\/2015\/11\/23\/setting-up-a-linked-data-mirror-from-rdf-dumps-dbpedia-2015-04-freebase-wikidata-linkedgeodata-with-virtuoso-7-2-1-and-docker-optional\/","title":{"rendered":"Setting up a Linked Data mirror from RDF dumps (DBpedia 2015-04, Freebase, Wikidata, LinkedGeoData, &#8230;) with Virtuoso 7.2.1 and Docker (optional)"},"content":{"rendered":"<p>So you&#8217;re the guy who is allowed to setup a local DBpedia mirror or more generally a local Linked Data mirror for your work group? OK, today is your lucky day and you&#8217;re in the right place. I hope you&#8217;ll be able to benefit from my many hours of trials and errors. If anything goes wrong (or everything works fine), feel free to leave a comment below.<\/p>\n<h2>Versions of this guide<\/h2>\n<p>There are four older versions of this guide:<\/p>\n<ul>\n<li><a href=\"https:\/\/joernhees.de\/blog\/2010\/10\/31\/setting-up-a-local-dbpedia-mirror-with-virtuoso\">Oct. 2010<\/a>: The first version focusing on DBpedia 3.5 &#8211; 3.6 and Virtuoso 6.1<\/li>\n<li><a href=\"https:\/\/joernhees.de\/blog\/2012\/05\/25\/setting-up-a-local-dbpedia-3-7-mirror-with-virtuoso-6-1-5\/\">May 2012<\/a>: A bigger update to DBpedia 3.7 (new local language versions) and Virtuoso 6.1.5+ (with a lot of updates making pre-processing of the dumps easier)<\/li>\n<li><a href=\"https:\/\/joernhees.de\/blog\/2014\/04\/23\/setting-up-a-local-dbpedia-3-9-mirror-with-virtuoso-7\/\">Apr. 2014<\/a>: Update to DBpedia 3.9 and Virtuoso 7<\/li>\n<li><a href=\"https:\/\/joernhees.de\/blog\/2014\/11\/10\/setting-up-a-local-dbpedia-2014-mirror-with-virtuoso-7-1-0\/\">Nov. 2014<\/a>: Update to DBpedia 2014 and other Datasets and Virtuoso 7.1.0<\/li>\n<\/ul>\n<p>In this step by step guide I&#8217;ll tell you how to install a local Linked Data mirror of the <a href=\"http:\/\/wiki.dbpedia.org\/Downloads2015-04\" target=\"_blank\">DBpedia 2015-04<\/a>, hosting a combination of the regular English and (exemplary) the i18n German datasets adding up to nearly 850 M triples.<\/p>\n<p>I&#8217;ll also mention how you can add the following datasets \/ vocabularies adding up to nearly 6 G triples:<\/p>\n<ul>\n<li><a href=\"https:\/\/developers.google.com\/freebase\/data\" target=\"_blank\">Freebase<\/a> (~ 3 G triples)<\/li>\n<li><a href=\"http:\/\/linkedgeodata.org\/Datasets\" target=\"_blank\">LinkedGeoData<\/a> (~ 1 G triples)<\/li>\n<li><a href=\"http:\/\/tools.wmflabs.org\/wikidata-exports\/rdf\/index.php?content=exports.php\" target=\"_blank\">Wikidata<\/a> (~ 841 M triples)<\/li>\n<li><a href=\"http:\/\/dblp.l3s.de\/dblp++.php\" target=\"_blank\">DBLP<\/a> (~ 82 M triples)<\/li>\n<li><a href=\"https:\/\/www.mpi-inf.mpg.de\/de\/departments\/databases-and-information-systems\/research\/yago-naga\/yago\/downloads\/\" target=\"_blank\">Yago Labels<\/a> (~ 45 M triples)<\/li>\n<li><a href=\"https:\/\/github.com\/structureddynamics\/umbel\" target=\"_blank\">Umbel (~ 480 K triples)<\/a><\/li>\n<li><a href=\"http:\/\/schema.rdfs.org\/\" target=\"_blank\">Schema.org (~ 9 K triples)<\/a>\n<\/ul>\n<p>As DBpedia is quite modular and has many internationalized (i18n) versions it has its own <a href=\"#DBpedia_URIs_vs_IRIs\">section<\/a> in this guide, the other datasets don&#8217;t, as they maximally need minor repacking and a single line to load as explained below.<\/p>\n<h2>Used Versions<\/h2>\n<ul>\n<li>DBpedia 2015-04<\/li>\n<li>Virtuoso OpenSource 7.2.1<\/li>\n<li>Ubuntu 14.04 LTS or Debian 8<\/li>\n<\/ul>\n<h2>Prerequisites<\/h2>\n<p>A strong machine with root access and enough RAM: We used a VM with 4 Cores and 32 GBs of RAM for DBpedia only. If you intend to also load Freebase and other datasets i recommend at least 64 GBs of RAM (we actually ended up using a 16 Core, 256 GB RAM Server in our research group). For installing i recommend more than 128 GB free HD space for DBpedia alone, 512 GB if you want to load Freebase as well, especially for downloading and repacking the datasets, as well as the growing database file when importing (mine grew to 64 GBs for DBpedia and 320 GB with all the datasets mentioned above).<\/p>\n<p>This guide applies to a clean install. Please check that there&#8217;s no older version of Virtuoso installed with <code>dpkg -l | grep virtuoso ; which isql ; which isql-vt<\/code> (no output is good). If there is, please know what you&#8217;re doing. Virtuoso 6 and 7 use different default locations for their DBs, but in general newer versions should be able to upgrade older DB files if correctly configured to use the same DB file. In general i&#8217;d suggest to either uninstall the older version and its config files and then install the new one according to this guide or to isolate the newer one with the docker approach mentioned below.<\/p>\n<h2>For the impatient and docker affine<\/h2>\n<p>As an alternative to the following sections, which will explain how to build everything from source yourself and go into <a href=\"#DBpedia_URIs_vs_IRIs\">details about the DBpedia dump files<\/a>, i also provide a <a href=\"https:\/\/hub.docker.com\/r\/joernhees\/virtuoso\/\" target=\"_blank\">docker image<\/a> (<a href=\"https:\/\/github.com\/joernhees\/docker-virtuoso\" target=\"_blank\">source<\/a>) that you can use to automate and simplify the process a lot:<\/p>\n<pre><code class=\"bash\">dump_dir=~\/dumps\/dbpedia\/2015-04\r\ndb_dir=~\/virtuoso_db\r\nmkdir -p \"$dump_dir\"\r\ncd \"$dump_dir\"\r\n\r\n# downloading\r\nwget -r -nc -nH --cut-dirs=1 -np -l1 \\\r\n    -A '*.nt.bz2' -A '*.owl' -R '*unredirected*' \\\r\n    http:\/\/downloads.dbpedia.org\/2015-04\/core\/\r\n\r\n# repacking\r\napt-get install pigz pbzip2\r\nfor i in *\/*.nt.bz2 ; do echo $i ; pbzip2 -dc \"$i\" | pigz - &gt; \"${i%bz2}gz\" &amp;&amp; rm \"$i\"; done\r\nmkdir classes\r\ncd classes\r\nwget http:\/\/downloads.dbpedia.org\/2015-04\/dbpedia_2015-04.owl\r\ncd\r\n\r\n# install some VAD packages for DBpedia into our db which we'll keep in db_dir\r\ndocker run -d --name dbpedia-vadinst \\\r\n    -v \"$db_dir\":\/var\/lib\/virtuoso-opensource-7 \\\r\n    joernhees\/virtuoso run &amp;&amp;\r\ndocker exec dbpedia-vadinst wait_ready &amp;&amp;\r\ndocker exec dbpedia-vadinst isql-vt PROMPT=OFF VERBOSE=OFF BANNER=OFF \\\r\n    \"EXEC=vad_install('\/usr\/share\/virtuoso-opensource-7\/vad\/rdf_mappers_dav.vad');\" &amp;&amp;\r\ndocker exec dbpedia-vadinst isql-vt PROMPT=OFF VERBOSE=OFF BANNER=OFF \\\r\n    \"EXEC=vad_install('\/usr\/share\/virtuoso-opensource-7\/vad\/dbpedia_dav.vad');\" &amp;&amp;\r\ndocker stop dbpedia-vadinst &amp;&amp;\r\ndocker rm -v dbpedia-vadinst &amp;&amp;\r\n\r\n# starting the import\r\ndocker run --rm \\\r\n    -v \"$db_dir\":\/var\/lib\/virtuoso-opensource-7 \\\r\n    -v \"$dump_dir\"\/classes:\/import:ro \\\r\n    joernhees\/virtuoso import 'http:\/\/dbpedia.org\/resource\/classes#' &amp;&amp;\r\n# docker import of the actual data (will use 64 GB RAM and take about 1 hour)\r\ndocker run --rm \\\r\n    -v \"$db_dir\":\/var\/lib\/virtuoso-opensource-7 \\\r\n    -v \"$dump_dir\"\/core:\/import:ro \\\r\n    -e \"NumberOfBuffers=$((64*85000))\" \\\r\n    joernhees\/virtuoso import 'http:\/\/dbpedia.org' &amp;&amp;\r\n\r\n# running the local endpoint on port 8891 with 32 GB RAM:\r\ndocker run --name dbpedia \\\r\n    -v \"$db_dir\":\/var\/lib\/virtuoso-opensource-7 \\\r\n    -p 8891:8890 \\\r\n    -e \"NumberOfBuffers=$((32*85000))\" \\\r\n    joernhees\/virtuoso run\r\n\r\n# access one of the following for example:\r\n# http:\/\/localhost:8891\/sparql\r\n# http:\/\/localhost:8891\/resource\/Bonn\r\n# http:\/\/localhost:8891\/conductor (user: dba, pw: dba)\r\n<\/code><\/pre>\n<h2>The manual version<\/h2>\n<h3>Download and build Virtuoso<\/h3>\n<p>We&#8217;ll download Virtuoso OpenSource: either from <a href=\"http:\/\/sourceforge.net\/projects\/virtuoso\/\" target=\"_blank\">SourceForge<\/a> or <a href=\"https:\/\/github.com\/openlink\/virtuoso-opensource\/releases\" target=\"_blank\">GitHub<\/a> (make sure you get <a href=\"https:\/\/github.com\/openlink\/virtuoso-opensource\/releases\/download\/v7.2.1\/virtuoso-opensource-7.2.1.tar.gz\">v7.2.1<\/a> as in this guide or a newer version).<\/p>\n<p>Unlike in earlier versions of this guide we&#8217;ll now first build the .deb packages and then install them with apt-get.<\/p>\n<p>As building will install a lot of extra packages that you only need for building, i prepared another <a href=\"https:\/\/hub.docker.com\/r\/joernhees\/dpkg_build\/\" target=\"_blank\">docker image<\/a> (<a href=\"https:\/\/github.com\/joernhees\/docker-dpkg-build\" target=\"_blank\">source<\/a>) that will do the whole building job inside a container for you and put the resulting .deb packages (and DBpedia VAD) into your <code>~\/virtuoso_deb<\/code> folder:<\/p>\n<pre><code class=\"bash\">docker run --rm -it -v ~\/virtuoso_deb:\/export\/ joernhees\/dpkg_build \\\r\n    https:\/\/github.com\/openlink\/virtuoso-opensource\/releases\/download\/v7.2.1\/virtuoso-opensource-7.2.1.tar.gz \\\r\n    -j5\r\n# this should run for about 15 minutes\r\n# compilation by default sadly does not create the dbpedia VAD package, so\r\n# to do that, the above command stops after compilation in interactive mode.\r\n# in there just execute this:\r\ncd \/tmp\/build\/virtuoso*\/ &amp;&amp;\r\n.\/configure --with-layout=debian --enable-dbpedia-vad &amp;&amp;\r\ncd binsrc &amp;&amp;\r\nmake &amp;&amp;\r\ncp dbpedia\/dbpedia_dav.vad \/export &amp;&amp;\r\nexit\r\n<\/code><\/pre>\n<p>If you used this, you can skip the following down to installing the .deb packages.<\/p>\n<p>If not, to do the building manually run this to download the file, put it in your home dir on the server, then extract it and switch to the directory:<\/p>\n<pre><code class=\"bash\">mkdir ~\/virtuoso_deb\r\ncd ~\/virtuoso_deb\r\nwget https:\/\/github.com\/openlink\/virtuoso-opensource\/releases\/download\/v7.2.1\/virtuoso-opensource-7.2.1.tar.gz\r\ntar -xvzf virtuoso-7.2.1.tar.gz\r\ncd virtuoso-opensource-7.2.1  # or newer, depending what you got\r\n<\/code><\/pre>\n<p>Afterwards you can use the following to install the build dependencies and actually build the .deb packages:<\/p>\n<pre><code class=\"bash\"># install build tools\r\nsudo apt-get install -y build-essential devscripts\r\n# to install Virtuoso build dependencies\r\nmk-build-deps -irt'apt-get --no-install-recommends -yV' &amp;&amp; dpkg-checkbuilddeps\r\n# to build Virtuoso with 5 processes in parallel\r\n# choose something like your server's #CPUs + 1\r\ndpkg-buildpackage -us -uc -5\r\n<\/code><\/pre>\n<p>This will take about <strong>15 min<\/strong>.<br \/>\nAfterwards if everything worked out, you should have the *.deb files in <code>~\/virtuoso_deb<\/code>.<\/p>\n<p>We continue to also build the DBpedia VAD:<\/p>\n<pre><code class=\"bash\">.\/configure --with-layout=debian --enable-dbpedia-vad &amp;&amp; \\\r\ncd binsrc &amp;&amp; make \\\r\ncp dbpedia\/dbpedia_dav.vad ~\/virtuoso_deb\/\r\n<\/code><\/pre>\n<p>Finally, let&#8217;s create a small local repository out of the .deb files you just built. The advantage of this is that you can simply install virtuoso-server with its dependencies with apt. In theory you could also resolve them manually and install everything with <code>dpkg -i ...<\/code>, but where&#8217;s the fun in that?<\/p>\n<pre><code class=\"bash\">cd ~\/virtuoso_deb\r\ndpkg-scanpackages .\/ | gzip &gt; Packages.gz\r\n<\/code><\/pre>\n<h3>Installing Virtuoso<\/h3>\n<p>No matter if you used the docker or manual building approach for the .deb packages of Virtuoso, you should now be able to install them with <code>apt-get install ...<\/code> after telling it where to look for the files for example by doing this:<\/p>\n<pre><code class=\"bash\">sudo echo \"deb file:~\/virtuoso_deb .\/\" &gt;&gt; \/etc\/apt\/sources.list.d\/virtuoso_local_packages.list\r\nsudo apt-get update\r\n<\/code><\/pre>\n<p>After this just install Virtuoso with the following command (it should warn you about untrusted sources of the Virtuoso packages, which is because we just built them ourselves):<\/p>\n<pre><code class=\"bash\">sudo apt-get install virtuoso-server \\\r\n    virtuoso-vad-bpel \\\r\n    virtuoso-vad-conductor \\\r\n    virtuoso-vad-demo \\\r\n    virtuoso-vad-doc \\\r\n    virtuoso-vad-isparql \\\r\n    virtuoso-vad-ods \\\r\n    virtuoso-vad-rdfmappers \\\r\n    virtuoso-vad-sparqldemo \\\r\n    virtuoso-vad-syncml \\\r\n    virtuoso-vad-tutorial\r\n<\/code><\/pre>\n<p>The above will ask you for a DBA password. Please pick one.<\/p>\n<p>Installing the VAD packages here will actually not install them in the Virtuoso DB file, but just move them in the right place so they can for example be installed as mentioned <a href=\"#finalizing\">later<\/a>.<\/p>\n<p>To also move the DBpedia VAD in place for later you can just run this:<\/p>\n<pre><code class=\"bash\">sudo cp ~\/virtuoso_deb\/dbpedia_dav.vad \/usr\/share\/virtuoso-opensource-7\/vad\/\r\n<\/code><\/pre>\n<h3>Configuring Virtuoso<\/h3>\n<p>Now change the following values in <code>\/etc\/virtuoso-opensource-7\/virtuoso.ini<\/code>, the performance tuning stuff is according to <a href=\"http:\/\/virtuoso.openlinksw.com\/dataspace\/dav\/wiki\/Main\/VirtRDFPerformanceTuning\">http:\/\/virtuoso.openlinksw.com\/dataspace\/dav\/wiki\/Main\/VirtRDFPerformanceTuning<\/a>:<\/p>\n<pre><code class=\"bash\"># note: Virtuoso ignores lines starting with whitespace and stuff after a ;\r\n[Parameters]\r\n# you need to include the directory where your datasets will be downloaded\r\n# to, in our case \/usr\/local\/data\/datasets:\r\nDirsAllowed = ., \/usr\/share\/virtuoso\/vad, \/usr\/local\/data\/datasets\r\n# IMPORTANT: for performance also do this\r\n[Parameters]\r\n# the following two are as suggested by comments in the original .ini\r\n# file in order to use the RAM on your server:\r\nNumberOfBuffers = 2720000\r\nMaxDirtyBuffers = 2000000\r\n# each buffer caches a 8K page of data and occupies approx. 8700 bytes of\r\n# memory. It's suggested to set this value to 65 % of ram for a db only server\r\n# so if you have 32 GB of ram: 32*1000^3*0.65\/8700 = 2390804\r\n# default is 2000 which will use 16 MB ram ;)\r\n# Make sure to remove whitespace if you uncomment existing lines!\r\n[Database]\r\nMaxCheckpointRemap = 625000\r\n# set this to 1\/4th of NumberOfBuffers\r\n[SPARQL]\r\n# I like to increase the ResultSetMaxrows, MaxQueryCostEstimationTime\r\n# and MaxQueryExecutionTime drastically as it's a local store where we\r\n# do quite complex queries... up to you (don't do this if a lot of people\r\n# use it).\r\n# In any case for the importer to be more robust add the following setting\r\n# to this section:\r\nShortenLongURIs = 1\r\n<\/code><\/pre>\n<p>Afterwards restart Virtuoso:<\/p>\n<pre><code class=\"bash\">sudo \/etc\/init.d\/virtuoso-opensource-7 stop\r\n<\/code><\/pre>\n<p>You should now have a running Virtuoso server.<\/p>\n<h3 id=\"DBpedia_URIs_vs_IRIs\">DBpedia URIs (en) vs. DBpedia IRIs (i18n)<\/h3>\n<p>The DBpedia 2015-04 consists of several datasets: one &#8220;standard&#8221; English version and several localized versions for other languages (i18n). The standard version mints URIs by going through all English Wikipedia articles. For all of these the Wikipedia cross-language links are used to extract corresponding labels in other languages for the en URIs (e.g., <code>core\/labels-en-uris_de.nt.bz2<\/code>). This is problematic as for example articles which are only in the German Wikipedia won&#8217;t be extracted. To solve this problem the i18n versions exists and create IRIs in the form of de.dbpedia.org for every article in the German Wikipedia (e.g., <code>core-i18n\/de\/labels_de.nt.bz2<\/code>).<\/p>\n<p>This approach has several implications. For backwards compatibility reasons the standard DBpedia makes statements about URIs such as <a href=\"http:\/\/dbpedia.org\/resource\/Gerhard_Schr%C3%B6der\">http:\/\/dbpedia.org\/resource\/Gerhard_Schr%C3%B6der<\/a> while the local chapters, like the German one, make statements about IRIs such as <a href=\"http:\/\/de.dbpedia.org\/resource\/Gerhard_Schr\u00f6der\">http:\/\/de.dbpedia.org\/resource\/Gerhard_Schr\u00f6der<\/a> (note the \u00f6). In other words and as written above: the standard DBpedia uses URIs to identify things, while the localized versions use IRIs. This also means that <a href=\"http:\/\/dbpedia.org\/resource\/Gerhard_Schr\u00f6der\">http:\/\/dbpedia.org\/resource\/Gerhard_Schr\u00f6der<\/a> shouldn&#8217;t work. That said, clicking the link will actually work as there is magic going on in your browser to give you what you probably meant. Using curl <code>curl -i -L -H \"Accept: application\/rdf+xml\" http:\/\/dbpedia.org\/resource\/Gerhard_Schr\u00f6der<\/code> or SPARQLing the endpoint will nevertheless not be so nice\/sloppy and can cause quite some headache. Observe how the following two SPARQL queries return different results: <a href=\"http:\/\/dbpedia.org\/sparql?default-graph-uri=http%3A%2F%2Fdbpedia.org&amp;qtxt=select+*+where+{+dbpedia%3AGerhard_Schr\u00f6der+%3Fp+%3Fo.+}&amp;format=text%2Fhtml&amp;timeout=30000&amp;debug=on\">select * where { dbpedia:Gerhard_Schr\u00f6der ?p ?o. }<\/a> vs. <a href=\"http:\/\/dbpedia.org\/sparql?default-graph-uri=http%3A%2F%2Fdbpedia.org&amp;qtxt=select+*+where+{+%3Chttp%3A%2F%2Fdbpedia.org%2Fresource%2FGerhard_Schr%25C3%25B6der%3E+%3Fp+%3Fo.+}&amp;format=text%2Fhtml&amp;timeout=30000&amp;debug=on\">select * where { &lt;http:\/\/dbpedia.org\/resource\/Gerhard_Schr%C3%B6der&gt; ?p ?o. }<\/a>. In order to mitigate this historic problem a bit DBpedia actually offers owl:sameAs links from IRIs to URIs: <a href=\"http:\/\/wiki.dbpedia.org\/Downloads2015-04#p14506-35\">core\/iri-same-as-uri_en.nt.bz2<\/a> which you should load, so you at least have a link to what you want if someone tries to get info about an IRI.<\/p>\n<p>As the standard DBpedia provides labels, abstracts and a couple other things in several languages, there are two types of files in the localized DBpedia folders: There are triples directly associating the English URIs with for example the German labels (<code>{core,core-i18n\/de}\/labels-en-uris_de.nt.bz2<\/code>) and there are the localized triple files which associate for example the DE IRIs with the German labels (<code>core-i18n\/de\/labels_de.nt.bz2<\/code>).<\/p>\n<h3>Downloading the DBpedia dump files, de-duplication &amp; Repacking<\/h3>\n<p>For our group we decided that we wanted a reasonably complete mirror of the standard DBpedia (EN) (have a look at the <a href=\"http:\/\/downloads.dbpedia.org\/2015-04\/core\/\" target=\"_blank\">core<\/a> directory, which contains all datasets loaded into the public DBpedia SPARQL Endpoint), but also the <a href=\"http:\/\/downloads.dbpedia.org\/2015-04\/core-i18n\/\" target=\"_blank\">i18n versions<\/a> for the German DBpedia loaded in separate graphs, as well as each of their pagelink datasets in yet another separate graph each. For this we download the corresponding files in (NT) format as follows. If you need something different <a href=\"http:\/\/downloads.dbpedia.org\/2015-04\/\" target=\"_blank\">do so<\/a> (and maybe report back if there were problems and how you solved them).<\/p>\n<pre><code class=\"bash\"># see comment above, you could also get another DBpedia version...\r\nmkdir -p \/usr\/local\/data\/datasets\/dbpedia\/2015-04\r\ncd \/usr\/local\/data\/datasets\/dbpedia\/2015-04\r\nwget -r -nc -nH --cut-dirs=1 -np -l1 -A '*.nt.bz2' -A '*.owl' -R '*unredirected*' http:\/\/downloads.dbpedia.org\/2015-04\/{core\/,core-i18n\/en,core-i18n\/de,dbpedia_2015-04.owl}\r\n<\/code><\/pre>\n<p>As already mentioned, the DBpedia 2015-04 introduced a <code>core<\/code> folder which contains all files loaded on the public DBpedia endpoint. Be aware that if you download other folders like above you&#8217;ll be downloading some files twice in other folders (e.g., <code>labels-en-uris_de.nt.bz2<\/code> can be found in both, the <code>core<\/code> folder and the <code>core-i18n\/de<\/code> folder). Quite obvious, but especially the <code>core-i18n\/en<\/code> folder contains very many duplicate files from <code>core<\/code>. If want to see which downloaded files are duplicates (independent of their name) and especially which <code>core-i18n\/en<\/code> files were not loaded on the public endpoint, so are not in <code>core<\/code>, you can do the following:<\/p>\n<pre><code class=\"bash\"># compute md5 hashes for all downloaded files\r\nfind . -mindepth 2 -type f -print0 | xargs -0 md5sum &gt; md5sums\r\n\r\n# first check if there are duplicates in other folders without core\r\nLC_ALL=C sort md5sums | grep -v '\/core\/' | uniq -w32 -D\r\nba3fc042b14cb41e6c4282a6f7c45e02  .\/core-i18n\/en\/instance-types-dbtax-dbo_en.nt.bz2\r\nba3fc042b14cb41e6c4282a6f7c45e02  .\/core-i18n\/en\/instance_types_dbtax-dbo.nt.bz2\r\n<\/code><\/pre>\n<p>So it seems the <code>.\/core-i18n\/en\/instance-types-dbtax-dbo_en.nt.bz2<\/code> and <code>.\/core-i18n\/en\/instance_types_dbtax-dbo.nt.bz2<\/code> files are actually the same.<\/p>\n<p>To list all the files in <code>core-i18n\/en<\/code> which are duplicates do this:<\/p>\n<pre><code class=\"bash\"># list all dup files in core-i18n\/en\r\nLC_ALL=C sort md5sums | uniq -w32 -D | grep '\/core-i18n\/en'\r\n068975f6dd60f29d13c8442b0dbe403d  .\/core-i18n\/en\/skos-categories_en.nt.bz2\r\n14a770f293524a5713f741a1a448bcfa  .\/core-i18n\/en\/short-abstracts_en.nt.bz2\r\n1904ad5bc4579fd7efe7f40673c32f79  .\/core-i18n\/en\/specific-mappingbased-properties_en.nt.bz2\r\n1958649209bc90944c65eccd30d37c6c  .\/core-i18n\/en\/infobox-property-definitions_en.nt.bz2\r\n2774d36ce14e0143ca4fa25ed212a598  .\/core-i18n\/en\/external-links_en.nt.bz2\r\n314162db2acb516a1ef5fcb3a2c7df2b  .\/core-i18n\/en\/geonames_links_en.nt.bz2\r\n3b42f351fc30f6b6b97d3f2a16ef6db3  .\/core-i18n\/en\/instance-types-transitive_en.nt.bz2\r\n3b61b11bdcb50a0d44ca8f4bd68f4762  .\/core-i18n\/en\/revision-ids_en.nt.bz2\r\n43a8b17859c50d37f4cab83573c2992e  .\/core-i18n\/en\/instance_types_sdtyped-dbo_en.nt.bz2\r\n4c847b2754294c555236d09485200435  .\/core-i18n\/en\/instance-types_en.nt.bz2\r\n63e2cde88e7bdefb6739c62aa234fc1e  .\/core-i18n\/en\/category-labels_en.nt.bz2\r\n64cbbac14769aadf560496b4d948d5e1  .\/core-i18n\/en\/interlanguage-links-chapters_en.nt.bz2\r\n75f2d135459c824feee1d427e4165a4f  .\/core-i18n\/en\/transitive-redirects_en.nt.bz2\r\n82fe80c3868a89d54fec26c919a4fa50  .\/core-i18n\/en\/revision-uris_en.nt.bz2\r\n8407c84d262b573418326bdd8f591b95  .\/core-i18n\/en\/mappingbased-properties_en.nt.bz2\r\n87df057913a05dbb5666f360d20fa542  .\/core-i18n\/en\/freebase-links_en.nt.bz2\r\n8cc921fbab5d02ad83b1fda2f87c23f0  .\/core-i18n\/en\/wikipedia-links_en.nt.bz2\r\n9152e34db96df2dd4991e78b7e53ff3f  .\/core-i18n\/en\/article-categories_en.nt.bz2\r\n94b48e9df78f746e60a9d0c1aafa3241  .\/core-i18n\/en\/infobox-properties_en.nt.bz2\r\na254ce4596d045cc047959831edd318a  .\/core-i18n\/en\/disambiguations_en.nt.bz2\r\nab29899e43fab1c6f060cdb8955c5b19  .\/core-i18n\/en\/images_en.nt.bz2\r\nae046e03be0cf29eac1e3b8a8b3d6b03  .\/core-i18n\/en\/persondata_en.nt.bz2\r\nb4710d36b8dc915f07f5cec2d9971a27  .\/core-i18n\/en\/page-ids_en.nt.bz2\r\nba3fc042b14cb41e6c4282a6f7c45e02  .\/core-i18n\/en\/instance-types-dbtax-dbo_en.nt.bz2\r\nba3fc042b14cb41e6c4282a6f7c45e02  .\/core-i18n\/en\/instance_types_dbtax-dbo.nt.bz2\r\nbd90ce4064a120794b5eb5a8d024a97d  .\/core-i18n\/en\/long-abstracts_en.nt.bz2\r\ne4c422d1d23c69eff3b9d7d7df3f2f80  .\/core-i18n\/en\/homepages_en.nt.bz2\r\neafc557cde69fd1cd8f78565c385ee16  .\/core-i18n\/en\/iri-same-as-uri_en.nt.bz2\r\nef48deae48c9c9c5e17585e3f0243663  .\/core-i18n\/en\/labels_en.nt.bz2\r\nfa8800165c7e80509a4ebddc5f0caf90  .\/core-i18n\/en\/geo-coordinates_en.nt.bz2\r\n\r\n# to delete the duplicates from \/core-i18n\/en, leaving just one of each:\r\nLC_ALL=C sort md5sums | uniq -w32 -D | grep '\/core-i18n\/en' | uniq -w32 | cut -d' ' -f3 | xargs rm\r\n\r\n# afterwards these should be left:\r\nls -1 core-i18n\/en\r\ncore-i18n\/en\/anchor-text_en.nt.bz2\r\ncore-i18n\/en\/article-templates_en.nt.bz2\r\ncore-i18n\/en\/flickr-wrappr-links_en.nt.bz2\r\ncore-i18n\/en\/genders_en.nt.bz2\r\ncore-i18n\/en\/instance_types_dbtax-dbo.nt.bz2\r\ncore-i18n\/en\/instance_types_dbtax_ext.nt.bz2\r\ncore-i18n\/en\/instance_types_lhd_dbo_en.nt.bz2\r\ncore-i18n\/en\/instance_types_lhd_ext_en.nt.bz2\r\ncore-i18n\/en\/interlanguage-links_en.nt.bz2\r\ncore-i18n\/en\/out-degree_en.nt.bz2\r\ncore-i18n\/en\/page-length_en.nt.bz2\r\ncore-i18n\/en\/page-links_en.nt.bz2\r\ncore-i18n\/en\/pnd_en.nt.bz2\r\ncore-i18n\/en\/redirects_en.nt.bz2\r\ncore-i18n\/en\/topical-concepts_en.nt.bz2\r\n<\/code><\/pre>\n<p>As Virtuoso can only import plain (uncompressed) or gzipped files, but the DBpedia dumps are bzipped, you can either repack them into gzip format or extract them. On our server the importing procedure was reasonably slower from extracted files than from gzipped ones (ignoring the vast amount of wasted disk space for the extracted files). <strong>File access becomes a bottleneck<\/strong> if you have a couple of cores idling. This is why I decided on <strong>repacking<\/strong> all the files from bz2 to gz. As you can see I do the repacking with the parallel versions of bz2 and gz. If that&#8217;s not suitable for you, feel free to change it. You might also want to change this if you want to do it in parallel to downloading. The repackaging process below took about <strong>30 minutes<\/strong> but was worth it in the end. The more CPUs you have, the more you can parallelize this process.<\/p>\n<pre><code class=\"bash\"># if you want to save space do this:\r\napt-get install pigz pbzip2\r\nfor i in core\/*.nt.bz2 core-i18n\/*\/*.nt.bz2 ; do echo $i ; pbzip2 -dc \"$i\" | pigz - &gt; \"${i%bz2}gz\" &amp;&amp; rm \"$i\" ; done\r\n\r\n# else do:\r\n#pbzip2 *\/*.bz2\r\n\r\n# notice that the extraction (and repacking) of *.bz2 takes quite a while (about 30 minutes)\r\n# gzipped data is reasonably packed, but still very fast to access (in contrast to bz2), so maybe this is the best choice.\r\n<\/code><\/pre>\n<h3>Data Cleaning and The bulk loader scripts<\/h3>\n<p>In contrast to the previous versions of this article the Virtuoso import will take care of shortening too long IRIs itself. Also it seems the bulk loader script is included in the more recent Virtuoso versions, so as a reference only: see the <a href=\"http:\/\/joernhees.de\/blog\/2010\/10\/31\/setting-up-a-local-dbpedia-mirror-with-virtuoso\">old version<\/a> for the cleaning script and <a href=\"http:\/\/virtuoso.openlinksw.com\/dataspace\/dav\/wiki\/Main\/VirtBulkRDFLoaderExampleDbpedia\">VirtBulkRDFLoaderExampleDbpedia<\/a> and <a href=\"http:\/\/virtuoso.openlinksw.com\/dataspace\/dav\/wiki\/Main\/VirtBulkRDFLoaderScript\"><br \/>\nhttp:\/\/virtuoso.openlinksw.com\/dataspace\/dav\/wiki\/Main\/VirtBulkRDFLoaderScript<\/a> for info about the bulk loader scripts.<\/p>\n<h3>Importing DBpedia dumps into Virtuoso<\/h3>\n<p>Now AFTER the re-\/unpacking of the DBpedia dumps we will register all files in the DBpedia dir (recursively <code>ld_dir_all<\/code>) to be added to the DBpedia graph. If you use this method make sure that only files reside in the given subtree that you really want to import.<br \/>\nAlso don&#8217;t forget to import the dbpedia_2015-04.owl file!<br \/>\nIf you only want one directory&#8217;s files to be added (non recursive) use <code>ld_dir('dir', '*.*', 'graph');<\/code>.<br \/>\nIf you manually want to add some files, use <code>ld_add('file', 'graph');<\/code>.<br \/>\nSee the <a href=\"http:\/\/virtuoso.openlinksw.com\/dataspace\/dav\/wiki\/Main\/VirtBulkRDFLoaderScript\">VirtBulkRDFLoaderScript<\/a> file for details.<\/p>\n<p><strong>Be warned<\/strong> that it might be a bad idea to import the normal and i18n dataset into the same graph if you didn&#8217;t select specific languages, as it might introduce a lot of duplicates that are hard to disentangle.<\/p>\n<p>In order to keep track (and easily reproduce) what was selected and imported into which graph, I actually link (<code>ln -s<\/code>) the repacked files into a directory structure beneath <code>\/usr\/local\/data\/datasets\/dbpedia\/2015-04\/importedGraphs\/<\/code> and import from there instead. To make sure you think about this, I use that path below, so it won&#8217;t work if you didn&#8217;t pay attention. If you really want to import all downloaded files, just import <code>\/usr\/local\/data\/datasets\/dbpedia\/2015-04\/<\/code>.<\/p>\n<p>Also be aware of the fact that if you load certain parts of dumps in different graphs (such as I did with the pagelinks, as well as the i18n versions of the DE and FR datasets) that only triples from the http:\/\/dbpedia.org graph will be shown when you visit the local pages with your browser (SPARQL is unaffected by this)!<\/em><\/p>\n<p>So if you only want to load <a href=\"http:\/\/wiki.dbpedia.org\/Downloads2015-04\">the same datasets as loaded on the official endpoint<\/a> then importing the <code>core<\/code> folder (first section below) and <code>dbpedia_2015-04.owl<\/code> file should be enough.<\/p>\n<p>The following will prepare the linking for the datasets we loaded:<\/p>\n<pre><code class=\"bash\">cd \/usr\/local\/data\/datasets\/dbpedia\/2015-04\/\r\nmkdir importedGraphs\r\ncd importedGraphs\r\n\r\nmkdir dbpedia.org\r\ncd dbpedia.org\r\n# ln -s ..\/..\/dbpedia*.owl .\/  # see below!\r\nln -s ..\/..\/core\/*.nt.gz .\/\r\ncd ..\r\n\r\nmkdir ext.dbpedia.org\r\ncd ext.dbpedia.org\r\nln -s ..\/..\/core-i18n\/en\/anchor-text_en.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/en\/article-templates_en.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/en\/genders_en.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/en\/instance_types_dbtax-dbo.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/en\/instance_types_dbtax_ext.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/en\/instance_types_lhd_dbo_en.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/en\/instance_types_lhd_ext_en.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/en\/out-degree_en.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/en\/page-length_en.nt.gz .\/\r\ncd ..\r\n\r\nmkdir pagelinks.dbpedia.org\r\ncd pagelinks.dbpedia.org\r\nln -s ..\/..\/core-i18n\/en\/page-links_en.nt.gz .\/\r\ncd ..\r\n\r\nmkdir topicalconcepts.dbpedia.org\r\ncd topicalconcepts.dbpedia.org\r\nln -s ..\/..\/core-i18n\/en\/topical-concepts_en.nt.gz .\/\r\ncd ..\r\n\r\n\r\nmkdir de.dbpedia.org\r\ncd de.dbpedia.org\r\nln -s ..\/..\/core-i18n\/de\/article-categories_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/article-templates_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/category-labels_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/disambiguations_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/external-links_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/freebase-links_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/geo-coordinates_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/geonames_links_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/homepages_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/images_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/infobox-properties_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/infobox-property-definitions_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/instance-types_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/instance_types_lhd_dbo_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/instance_types_lhd_ext_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/instance-types-transitive_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/interlanguage-links-chapters_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/interlanguage-links_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/iri-same-as-uri_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/labels_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/long-abstracts_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/mappingbased-properties_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/out-degree_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/page-ids_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/page-length_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/persondata_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/pnd_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/revision-ids_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/revision-uris_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/short-abstracts_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/skos-categories_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/specific-mappingbased-properties_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/transitive-redirects_de.nt.gz .\/\r\nln -s ..\/..\/core-i18n\/de\/wikipedia-links_de.nt.gz .\/\r\ncd ..\r\n\r\nmkdir pagelinks.de.dbpedia.org\r\ncd pagelinks.de.dbpedia.org\r\nln -s ..\/..\/core-i18n\/de\/page-links_de.nt.gz .\/\r\ncd ..\r\n<\/code><\/pre>\n<p>This should have prepared your <code>importedGraphs<\/code> directory. From this directory you can run the following command which prints out the necessary <code>isql-vt<\/code> commands to register your graphs for importing:<\/p>\n<pre><code class=\"bash\">for g in * ; do echo \"ld_dir_all('$(pwd)\/$g', '*.*', 'http:\/\/$g');\" ; done\r\n<\/code><\/pre>\n<p>One more thing (thanks to Romain): In order for the DBpedia.vad package (which is installed at the end) to work correctly, the <code>dbpedia_2014.owl<\/code> file needs to be imported into graph <code>http:\/\/dbpedia.org\/resource\/classes#<\/code>.<\/p>\n<p><em>Note: In the following i will assume that your Virtuoso isql command is called <code>isql-vt<\/code>. If you&#8217;re in lack of such a command, it might be called <code>isql<\/code> or <code>isql-v<\/code>, but this usually means you installed it using some other method than described in here<\/em><\/p>\n<pre><code class=\"bash\">isql-vt # enter Virtuoso isql mode\r\n<\/code><\/pre>\n<pre><code class=\"sql\">-- we are in sql mode now\r\nld_add('\/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/dbpedia_2015-04.owl', 'http:\/\/dbpedia.org\/resource\/classes#');\r\nld_dir_all('\/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org', '*.*', 'http:\/\/dbpedia.org');\r\nld_dir_all('\/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org', '*.*', 'http:\/\/de.dbpedia.org');\r\nld_dir_all('\/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/ext.dbpedia.org', '*.*', 'http:\/\/ext.dbpedia.org');\r\nld_dir_all('\/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/pagelinks.dbpedia.org', '*.*', 'http:\/\/pagelinks.dbpedia.org');\r\nld_dir_all('\/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/pagelinks.de.dbpedia.org', '*.*', 'http:\/\/pagelinks.de.dbpedia.org');\r\nld_dir_all('\/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/topicalconcepts.dbpedia.org', '*.*', 'http:\/\/topicalconcepts.dbpedia.org');\r\n\r\n-- do the following to see which files were registered to be added:\r\nselect * from DB.DBA.LOAD_LIST;\r\n-- if unsatisfied use:\r\n-- delete from DB.DBA.LOAD_LIST;\r\nEXIT;\r\n<\/code><\/pre>\n<p>You can now also register other datasets like <a href=\"https:\/\/developers.google.com\/freebase\/data\" target=\"_blank\">Freebase<\/a>, <a href=\"http:\/\/dblp.l3s.de\/dblp++.php\" target=\"_blank\">DBLP<\/a>, <a href=\"http:\/\/www.mpi-inf.mpg.de\/de\/departments\/databases-and-information-systems\/research\/yago-naga\/yago\/downloads\/\" target=\"_blank\">Yago<\/a>, <a href=\"https:\/\/github.com\/structureddynamics\/umbel\" target=\"_blank\">Umbel<\/a> and <a href=\"http:\/\/schema.rdfs.org\/\" target=\"_blank\">Schema.org<\/a> &#8230; that you want to be loaded after downloading them to the appropriate directories like this:<\/p>\n<pre><code class=\"bash\">ld_add('\/usr\/local\/data\/datasets\/remote\/schema.org\/2015-11-04\/all.nt', 'http:\/\/schema.org');\r\nld_dir_all('\/usr\/local\/data\/datasets\/remote\/umbel\/External Ontologies', '*.n3', 'http:\/\/umbel.org\/umbel\/rc');\r\nld_add('\/usr\/local\/data\/datasets\/remote\/umbel\/Ontology\/umbel.n3', 'http:\/\/umbel.org\/umbel');\r\nld_add('\/usr\/local\/data\/datasets\/remote\/umbel\/Reference Structure\/umbel_reference_concepts.n3', 'http:\/\/umbel.org\/umbel\/rc');\r\nld_add('\/usr\/local\/data\/datasets\/remote\/yago\/yago3\/2015-11-04\/yagoLabels.ttl.gz', 'http:\/\/yago-knowledge.org\/resource');\r\n\r\nld_add('\/usr\/local\/data\/datasets\/remote\/dblp\/l3s\/2015-11-04\/dblp.nt.gz', 'http:\/\/dblp.l3s.de');\r\n\r\nld_dir_all('\/usr\/local\/data\/datasets\/remote\/wikidata\/tools.wmflabs.org\/wikidata-exports\/rdf\/exports\/20151026', '*.nt.gz', 'http:\/\/www.wikidata.org');\r\nld_dir_all('\/usr\/local\/data\/datasets\/remote\/freebase\/2015-08-09', '*.nt.gz', 'http:\/\/rdf.freebase.com');\r\nld_dir_all('\/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09', '*.*', 'http:\/\/linkedgeodata.org');\r\n<\/code><\/pre>\n<p>Our full <code>DB.DBA.LOAD_LIST<\/code> currently looks like this:<\/p>\n<pre><code class=\"sql\">select ll_graph, ll_file from DB.DBA.LOAD_LIST;\r\n<\/code><\/pre>\n<pre><code>ll_graph                               ll_file\r\nVARCHAR                                VARCHAR NOT NULL\r\n____________________________________\r\n\r\nhttp:\/\/dblp.l3s.de                     \/usr\/local\/data\/datasets\/remote\/dblp\/l3s\/2015-11-04\/dblp.nt.gz\r\nhttp:\/\/dbpedia.org\/resource\/classes#   \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/dbpedia_2015-04.owl\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/amsterdammuseum_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/article-categories_en.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/bbcwildlife_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/bookmashup_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/bricklink_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/category-labels_en.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/cordis_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/dailymed_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/dblp_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/dbpedia_2015-04.owl\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/dbtune_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/disambiguations_en.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/diseasome_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/drugbank_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/eunis_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/eurostat_linkedstatistics_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/eurostat_wbsg_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/external-links_en.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/factbook_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/flickrwrappr_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/freebase-links_en.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/gadm_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/geo-coordinates_en.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/geonames_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/geonames_links_en.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/geospecies_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/gho_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/gutenberg_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/homepages_en.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/images_en.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/infobox-properties_en.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/infobox-property-definitions_en.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/instance-types-transitive_en.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/instance-types_en.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/instance_types_sdtyped-dbo_en.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/interlanguage-links-chapters_en.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/iri-same-as-uri_en.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/italian_public_schools_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/labels-en-uris_ar.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/labels-en-uris_de.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/labels-en-uris_es.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/labels-en-uris_fr.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/labels-en-uris_it.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/labels-en-uris_ja.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/labels-en-uris_nl.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/labels-en-uris_pl.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/labels-en-uris_pt.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/labels-en-uris_ru.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/labels-en-uris_zh.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/labels_en.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/linkedgeodata_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/linkedmdb_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/lobid.org-manifestation.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/lobid.org-organization.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/long-abstracts-en-uris_ar.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/long-abstracts-en-uris_de.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/long-abstracts-en-uris_es.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/long-abstracts-en-uris_fr.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/long-abstracts-en-uris_it.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/long-abstracts-en-uris_ja.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/long-abstracts-en-uris_nl.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/long-abstracts-en-uris_pl.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/long-abstracts-en-uris_pt.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/long-abstracts-en-uris_ru.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/long-abstracts-en-uris_zh.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/long-abstracts_en.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/mappingbased-properties_en.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/musicbrainz_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/nuts_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/nytimes_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/opencyc_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/openei_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/page-ids_en.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/persondata_en.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/revision-ids_en.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/revision-uris_en.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/revyu_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/short-abstracts-en-uris_ar.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/short-abstracts-en-uris_de.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/short-abstracts-en-uris_es.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/short-abstracts-en-uris_fr.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/short-abstracts-en-uris_it.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/short-abstracts-en-uris_ja.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/short-abstracts-en-uris_nl.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/short-abstracts-en-uris_pl.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/short-abstracts-en-uris_pt.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/short-abstracts-en-uris_ru.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/short-abstracts-en-uris_zh.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/short-abstracts_en.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/sider_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/skos-categories_en.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/specific-mappingbased-properties_en.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/tcm_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/transitive-redirects_en.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/transparency_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/uk-university_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/umbel_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/uscensus_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/viaf_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/wikicompany_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/wikipedia-links_en.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/wordnet_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/yago_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/yago_taxonomy.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/yago_type_links.nt.gz\r\nhttp:\/\/dbpedia.org                     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/dbpedia.org\/yago_types.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/article-categories_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/article-templates_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/category-labels_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/disambiguations_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/external-links_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/freebase-links_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/geo-coordinates_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/geonames_links_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/homepages_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/images_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/infobox-properties_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/infobox-property-definitions_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/instance-types-transitive_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/instance-types_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/instance_types_lhd_dbo_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/instance_types_lhd_ext_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/interlanguage-links-chapters_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/interlanguage-links_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/iri-same-as-uri_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/labels_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/long-abstracts_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/mappingbased-properties_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/out-degree_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/page-ids_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/page-length_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/persondata_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/pnd_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/revision-ids_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/revision-uris_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/short-abstracts_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/skos-categories_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/specific-mappingbased-properties_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/transitive-redirects_de.nt.gz\r\nhttp:\/\/de.dbpedia.org                  \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/de.dbpedia.org\/wikipedia-links_de.nt.gz\r\nhttp:\/\/ext.dbpedia.org                 \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/ext.dbpedia.org\/anchor-text_en.nt.gz\r\nhttp:\/\/ext.dbpedia.org                 \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/ext.dbpedia.org\/article-templates_en.nt.gz\r\nhttp:\/\/ext.dbpedia.org                 \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/ext.dbpedia.org\/genders_en.nt.gz\r\nhttp:\/\/ext.dbpedia.org                 \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/ext.dbpedia.org\/instance_types_dbtax-dbo.nt.gz\r\nhttp:\/\/ext.dbpedia.org                 \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/ext.dbpedia.org\/instance_types_dbtax_ext.nt.gz\r\nhttp:\/\/ext.dbpedia.org                 \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/ext.dbpedia.org\/instance_types_lhd_dbo_en.nt.gz\r\nhttp:\/\/ext.dbpedia.org                 \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/ext.dbpedia.org\/instance_types_lhd_ext_en.nt.gz\r\nhttp:\/\/ext.dbpedia.org                 \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/ext.dbpedia.org\/out-degree_en.nt.gz\r\nhttp:\/\/ext.dbpedia.org                 \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/ext.dbpedia.org\/page-length_en.nt.gz\r\nhttp:\/\/pagelinks.dbpedia.org           \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/pagelinks.dbpedia.org\/page-links_en.nt.gz\r\nhttp:\/\/pagelinks.de.dbpedia.org        \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/pagelinks.de.dbpedia.org\/page-links_de.nt.gz\r\nhttp:\/\/topicalconcepts.dbpedia.org     \/usr\/local\/data\/datasets\/remote\/dbpedia\/2015-04\/importedGraphs\/topicalconcepts.dbpedia.org\/topical-concepts_en.nt.gz\r\nhttp:\/\/rdf.freebase.com                \/usr\/local\/data\/datasets\/remote\/freebase\/2015-08-09\/fb2w.nt.gz\r\nhttp:\/\/rdf.freebase.com                \/usr\/local\/data\/datasets\/remote\/freebase\/2015-08-09\/freebase-rdf-2015-08-09-00-01.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-Abutters.node.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-Abutters.way.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-AerialwayThing.node.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-AerialwayThing.way.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-AerowayThing.node.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-AerowayThing.way.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-Amenity.node.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-Amenity.way.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-BarrierThing.node.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-BarrierThing.way.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-Boundary.node.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-Boundary.way.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-Craft.node.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-Craft.way.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-CyclewayThing.node.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-CyclewayThing.way.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-EmergencyThing.node.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-EmergencyThing.way.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-HistoricThing.node.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-HistoricThing.way.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-Leisure.node.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-Leisure.way.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-LockThing.node.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-LockThing.way.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-ManMadeThing.node.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-ManMadeThing.way.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-MilitaryThing.node.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-MilitaryThing.way.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-Office.node.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-Office.way.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-Place.node.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-Place.way.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-PowerThing.node.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-PowerThing.way.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-PublicTransportThing.node.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-PublicTransportThing.way.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-RailwayThing.node.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-RailwayThing.way.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-RouteThing.node.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-RouteThing.way.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-Shop.node.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-Shop.way.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-SportThing.node.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-SportThing.way.sorted.nt.gz\r\nhttp:\/\/linkedgeodata.org               \/usr\/local\/data\/datasets\/remote\/linkedgeodata\/2014-09-09\/2014-09-09-ontology.sorted.nt.gz\r\nhttp:\/\/schema.org                      \/usr\/local\/data\/datasets\/remote\/schema.org\/2015-11-04\/all.nt\r\nhttp:\/\/umbel.org\/umbel\/rc              \/usr\/local\/data\/datasets\/remote\/umbel\/External Ontologies\/dbpedia-ontology.n3\r\nhttp:\/\/umbel.org\/umbel\/rc              \/usr\/local\/data\/datasets\/remote\/umbel\/External Ontologies\/geonames.n3\r\nhttp:\/\/umbel.org\/umbel\/rc              \/usr\/local\/data\/datasets\/remote\/umbel\/External Ontologies\/opencyc.n3\r\nhttp:\/\/umbel.org\/umbel\/rc              \/usr\/local\/data\/datasets\/remote\/umbel\/External Ontologies\/same-as.n3\r\nhttp:\/\/umbel.org\/umbel\/rc              \/usr\/local\/data\/datasets\/remote\/umbel\/External Ontologies\/schema.org.n3\r\nhttp:\/\/umbel.org\/umbel\/rc              \/usr\/local\/data\/datasets\/remote\/umbel\/External Ontologies\/wikipedia.n3\r\nhttp:\/\/umbel.org\/umbel                 \/usr\/local\/data\/datasets\/remote\/umbel\/Ontology\/umbel.n3\r\nhttp:\/\/umbel.org\/umbel\/rc              \/usr\/local\/data\/datasets\/remote\/umbel\/Reference Structure\/umbel_reference_concepts.n3\r\nhttp:\/\/www.wikidata.org                \/usr\/local\/data\/datasets\/remote\/wikidata\/tools.wmflabs.org\/wikidata-exports\/rdf\/exports\/20151026\/wikidata-instances.nt.gz\r\nhttp:\/\/www.wikidata.org                \/usr\/local\/data\/datasets\/remote\/wikidata\/tools.wmflabs.org\/wikidata-exports\/rdf\/exports\/20151026\/wikidata-properties.nt.gz\r\nhttp:\/\/www.wikidata.org                \/usr\/local\/data\/datasets\/remote\/wikidata\/tools.wmflabs.org\/wikidata-exports\/rdf\/exports\/20151026\/wikidata-property-taxonomy.nt.gz\r\nhttp:\/\/www.wikidata.org                \/usr\/local\/data\/datasets\/remote\/wikidata\/tools.wmflabs.org\/wikidata-exports\/rdf\/exports\/20151026\/wikidata-simple-statements.nt.gz\r\nhttp:\/\/www.wikidata.org                \/usr\/local\/data\/datasets\/remote\/wikidata\/tools.wmflabs.org\/wikidata-exports\/rdf\/exports\/20151026\/wikidata-sitelinks.nt.gz\r\nhttp:\/\/www.wikidata.org                \/usr\/local\/data\/datasets\/remote\/wikidata\/tools.wmflabs.org\/wikidata-exports\/rdf\/exports\/20151026\/wikidata-statements.nt.gz\r\nhttp:\/\/www.wikidata.org                \/usr\/local\/data\/datasets\/remote\/wikidata\/tools.wmflabs.org\/wikidata-exports\/rdf\/exports\/20151026\/wikidata-taxonomy.nt.gz\r\nhttp:\/\/www.wikidata.org                \/usr\/local\/data\/datasets\/remote\/wikidata\/tools.wmflabs.org\/wikidata-exports\/rdf\/exports\/20151026\/wikidata-terms.nt.gz\r\nhttp:\/\/yago-knowledge.org\/resource     \/usr\/local\/data\/datasets\/remote\/yago\/yago3\/2015-11-04\/yagoLabels.ttl.gz\r\n\r\n219 Rows. -- 8 msec.\r\n<\/code><\/pre>\n<p>OK, now comes the fun (and long part: about <strong>1.5 hours<\/strong> (new Virtuoso 7 is cool \ud83d\ude09 for DBpedia alone, +~6 hours for Freebase)&#8230; After we registered the files to be added, now let&#8217;s finally <strong>start the process<\/strong>. Fire up <code>screen<\/code> if you didn&#8217;t already. (For more detailed metering than below see <a href=\"http:\/\/virtuoso.openlinksw.com\/dataspace\/doc\/dav\/wiki\/Main\/VirtTipsAndTricksGuideLDMeterUtility\">VirtTipsAndTricksGuideLDMeterUtility<\/a>.)<\/p>\n<pre><code class=\"bash\">sudo apt-get install screen\r\nscreen isql-vt\r\n<\/code><\/pre>\n<pre><code class=\"sql\">rdf_loader_run();\r\n-- DO NOT USE THE DB BESIDES THE FOLLOWING COMMANDS:\r\n-- depending on the amount of CPUs and your IO performance you can run\r\n-- more rdf_loader_run(); commands in other isql-vt sessions which will\r\n-- speed up the import process.\r\n-- you can watch the progress from another isql-vt session with:\r\n-- select * from DB.DBA.LOAD_LIST;\r\n-- if you need to stop the loading for any reason: rdf_load_stop();\r\n-- if you want to force stopping: rdf_load_stop(1);\r\ncheckpoint;\r\ncommit work;\r\ncheckpoint;\r\nEXIT;\r\n<\/code><\/pre>\n<p>After this:<br \/>\nTake a look into <code>var\/lib\/virtuoso\/db\/virtuoso.log<\/code> and run this:<\/p>\n<pre><code class=\"bash\">isql-vt BANNER=OFF VERBOSE=OFF 'EXEC=SELECT * FROM DB.DBA.LOAD_LIST WHERE ll_error IS NOT NULL;'\r\n<\/code><\/pre>\n<p>Should you find any <strong>errors<\/strong> in there&#8230; FIX THEM! You might be able to use the dump, but it&#8217;s incomplete in those cases. Any error quits out of the loading of the corresponding file and continues with the next one, so you&#8217;re only using the part of that file up to the place where the error occurred. (Should you find errors you can&#8217;t fix, please leave a comment.)<\/p>\n<h3 id=\"finalizing\">Final polishing<\/h3>\n<p>You can &amp; should now install the <strong>DBpedia<\/strong> and <strong>RDF Mappers packages<\/strong> from the Virtuoso Conductor.<br \/>\n<code>http:\/\/your-server:8890<\/code><\/p>\n<pre><code>login: dba\r\npw: dba\r\n<\/code><\/pre>\n<p>Go to System Admin \/ Packages. Install the DBpedia (v. 1.4.30) and rdf_mappers (v. 1.34.74) packages (takes about 5 minutes).<\/p>\n<h3>Testing your local mirror<\/h3>\n<p>Go to the sparql-endpoint of your server <code>http:\/\/your-server:8890\/sparql<\/code> (or in isql-vt prefix with: SPARQL)<\/p>\n<pre><code class=\"sql\">sparql SELECT count(*) WHERE { ?s ?p ?o } ;\r\n<\/code><\/pre>\n<p>This shouldn&#8217;t take long in Virtuoso 7 anymore and for me now returns 849,521,186 for DBpedia (en+de) or 5,959,006,725 with all the datasets mentioned above.<\/p>\n<p>I also like this query showing all the graphs and how many triples are in them:<\/p>\n<pre><code class=\"sql\">sparql SELECT ?g COUNT(*) as ?c { GRAPH ?g {?s ?p ?o.} } GROUP BY ?g ORDER BY DESC(?c);\r\ng                                                            c\r\nLONG VARCHAR                                                 LONG VARCHAR\r\n__________________________________________________________\r\n\r\nhttp:\/\/rdf.freebase.com                                      3126890738\r\nhttp:\/\/linkedgeodata.org                                     1013866920\r\nhttp:\/\/www.wikidata.org                                      841008708\r\nhttp:\/\/dbpedia.org                                           411914840\r\nhttp:\/\/pagelinks.dbpedia.org                                 158878272\r\nhttp:\/\/de.dbpedia.org                                        119876594\r\nhttp:\/\/ext.dbpedia.org                                       99042212\r\nhttp:\/\/dblp.l3s.de                                           81987210\r\nhttp:\/\/pagelinks.de.dbpedia.org                              59622795\r\nhttp:\/\/yago-knowledge.org\/resource                           44963422\r\nhttp:\/\/umbel.org\/umbel\/rc                                    480616\r\nhttp:\/\/www.openlinksw.com\/schemas\/RDF_Mapper_Ontology\/1.0\/   256065\r\nhttp:\/\/topicalconcepts.dbpedia.org                           157560\r\nhttp:\/\/dbpedia.org\/resource\/classes#                         28880\r\nhttp:\/\/schema.org                                            8727\r\nhttp:\/\/localhost:8890\/DAV\/                                   4806\r\nhttp:\/\/www.openlinksw.com\/schemas\/virtrdf#                   2472\r\nhttp:\/\/umbel.org\/umbel                                       1584\r\nhttp:\/\/open.vocab.org\/terms                                  1480\r\nhttp:\/\/purl.org\/ontology\/bibo\/                               1226\r\nhttp:\/\/purl.org\/goodrelations\/v1                             937\r\nhttp:\/\/purl.org\/dc\/terms\/                                    857\r\nhttp:\/\/www.openlinksw.com\/schemas\/opengraph                  804\r\nhttp:\/\/www.openlinksw.com\/schemas\/linkedin                   741\r\nhttp:\/\/www.openlinksw.com\/schemas\/googleplus                 696\r\nhttp:\/\/www.openlinksw.com\/schemas\/google-base                691\r\nhttp:\/\/www.openlinksw.com\/schemas\/cv                         661\r\nvirtrdf-label                                                638\r\nhttp:\/\/xmlns.com\/foaf\/0.1\/                                   557\r\nhttp:\/\/rdfs.org\/sioc\/ns#                                     553\r\nhttp:\/\/www.openlinksw.com\/schemas\/evri                       482\r\nhttp:\/\/www.openlinksw.com\/schemas\/crunchbase                 444\r\nhttp:\/\/bblfish.net\/work\/atom-owl\/2006-06-06\/                 386\r\nhttp:\/\/scot-project.org\/scot\/ns#                             332\r\nhttp:\/\/www.openlinksw.com\/schemas\/zillow                     311\r\nhttp:\/\/www.w3.org\/2004\/02\/skos\/core                          252\r\nhttp:\/\/www.openlinksw.com\/schemas\/cnet                       225\r\nhttp:\/\/www.openlinksw.com\/schemas\/tesco                      183\r\nhttp:\/\/www.openlinksw.com\/schemas\/bestbuy                    172\r\nhttp:\/\/www.w3.org\/2002\/07\/owl#                               160\r\nhttp:\/\/www.w3.org\/2002\/07\/owl                                160\r\nhttp:\/\/www.openlinksw.com\/schemas\/angel#                     144\r\nhttp:\/\/www.openlinksw.com\/schemas\/amazon                     143\r\nhttp:\/\/purl.org\/dc\/elements\/1.1\/                             139\r\nhttp:\/\/www.w3.org\/2007\/05\/powder-s#                          117\r\nhttp:\/\/www.openlinksw.com\/schemas\/twitter                    103\r\nhttp:\/\/www.openlinksw.com\/schemas\/stackoverflow#             102\r\nhttp:\/\/www.openlinksw.com\/schemas\/klout                      90\r\nhttp:\/\/www.w3.org\/2000\/01\/rdf-schema#                        87\r\nhttp:\/\/www.w3.org\/1999\/02\/22-rdf-syntax-ns#                  85\r\nhttp:\/\/www.openlinksw.com\/schemas\/ebay                       79\r\nhttp:\/\/www.openlinksw.com\/schema\/attribution#                68\r\nhttp:\/\/www.openlinksw.com\/schemas\/nyt                        41\r\nhttp:\/\/www.openlinksw.com\/schemas\/wolframalpha#              32\r\nhttp:\/\/www.openlinksw.com\/schemas\/oplbase                    26\r\nhttp:\/\/www.openlinksw.com\/schemas\/cert#                      23\r\nhttp:\/\/www.openlinksw.com\/schemas\/dbpedia-spotlight#         21\r\nhttp:\/\/www.openlinksw.com\/schemas\/money                      21\r\nhttp:\/\/localhost:8890\/sparql                                 14\r\nhttp:\/\/dbpedia.org\/schema\/property_rules#                    12\r\ndbprdf-label                                                 6\r\nhttp:\/\/www.w3.org\/ns\/ldp#                                    3\r\n\r\n62 Rows. -- 58092 msec.\r\n<\/code><\/pre>\n<p>Congratulations, you just imported nearly 850 million triples (or nearly 6 G triples for all datasets).<\/p>\n<h3>Backing up this initial state<\/h3>\n<p>Now is a good moment to backup the whole db (takes about <strong>half an hour<\/strong>):<\/p>\n<pre><code class=\"bash\">sudo -i\r\ncd \/\r\n\/etc\/init.d\/virtuoso-opensource stop &amp;&amp;\r\ntar -cvf - \/var\/lib\/virtuoso | lzop &gt; virtuoso-7.1.0-DBDUMP-$(date '+%F')-dbpedia-2015-04-en_de.tar.lzop &amp;&amp;\r\n\/etc\/init.d\/virtuoso-opensource start\r\n<\/code><\/pre>\n<p>Afterwards you might want to repack this with xz (lzma) like this:<\/p>\n<pre><code class=\"bash\"># apt-get install xz pxz\r\nfor f in virtuoso-7.1.0-DBDUMP-*.tar.lzop ; do lzop -d -c \"$f\" | pxz &gt; \"${f%lzop}.xz\" ; done\r\n<\/code><\/pre>\n<p>Yay, done \ud83d\ude09<br \/>\nAs always, feel free to leave comments if i made a mistake or to tell us about your problems or how happy you are :D.<\/p>\n<h3>Thanks<\/h3>\n<p>Many thanks to the DBpedia team for their endless efforts of providing us all with a great dataset. Also many thanks to the Virtuoso crew for releasing an OpenSource version of their DB.<\/p>\n<h3>Updates<\/h3>\n<ul>\n<li>2015-12-07: added a check for older installed versions.<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>So you&#8217;re the guy who is allowed to setup a local DBpedia mirror or more generally a local Linked Data mirror for your work group? OK, today is your lucky day and you&#8217;re in the right place. I hope you&#8217;ll be able to benefit from my many hours of trials and errors. If anything goes [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":""},"categories":[2,198],"tags":[33,34,216,213,69,88,89,214,106,134,170,171,187,215],"jetpack_featured_media_url":"","jetpack_shortlink":"https:\/\/wp.me\/pYA5n-bN","jetpack-related-posts":[{"id":610,"url":"https:\/\/joernhees.de\/blog\/2014\/11\/10\/setting-up-a-local-dbpedia-2014-mirror-with-virtuoso-7-1-0\/","url_meta":{"origin":731,"position":0},"title":"Setting up a local DBpedia 2014 mirror with Virtuoso 7.1.0","date":"2014-11-10","format":false,"excerpt":"Newer version available: Setting up a Linked Data mirror from RDF dumps (DBpedia 2015-04, Freebase, Wikidata, LinkedGeoData, ...) with Virtuso 7.2.1 and Docker (optional) So you're the guy who is allowed to setup a local DBpedia mirror or more generally a local Linked Data mirror for your work group? OK,\u2026","rel":"","context":"In &quot;Coding&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":584,"url":"https:\/\/joernhees.de\/blog\/2014\/04\/23\/setting-up-a-local-dbpedia-3-9-mirror-with-virtuoso-7\/","url_meta":{"origin":731,"position":1},"title":"Setting up a local DBpedia 3.9 mirror with Virtuoso 7","date":"2014-04-23","format":false,"excerpt":"Newer version available: Setting up a Linked Data mirror from RDF dumps (DBpedia 2015-04, Freebase, Wikidata, LinkedGeoData, ...) with Virtuso 7.2.1 and Docker (optional) I just found this aged post in my drafts folder, maybe someone will still like it... So you're the guy who is allowed to setup a\u2026","rel":"","context":"In &quot;Coding&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":277,"url":"https:\/\/joernhees.de\/blog\/2010\/10\/31\/setting-up-a-local-dbpedia-mirror-with-virtuoso\/","url_meta":{"origin":731,"position":2},"title":"Setting up a local DBpedia mirror with Virtuoso","date":"2010-10-31","format":false,"excerpt":"So you're the guy who is allowed to setup a local DBpedia mirror for your work group? OK, today is your lucky day and you're in the right place. I hope you'll be able to benefit from my hours of trials and errors ;)","rel":"","context":"In &quot;Coding&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":442,"url":"https:\/\/joernhees.de\/blog\/2012\/05\/25\/setting-up-a-local-dbpedia-3-7-mirror-with-virtuoso-6-1-5\/","url_meta":{"origin":731,"position":3},"title":"Setting up a local DBpedia 3.7 mirror with Virtuoso 6.1.5+","date":"2012-05-25","format":false,"excerpt":"Newer version available: Setting up a Linked Data mirror from RDF dumps (DBpedia 2015-04, Freebase, Wikidata, LinkedGeoData, ...) with Virtuso 7.2.1 and Docker (optional) Nearly 1.5 years after i initially published a post about how to setup a local DBpedia mirror i recently revisited the problem myself to setup a\u2026","rel":"","context":"In &quot;Coding&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":643,"url":"https:\/\/joernhees.de\/blog\/2015\/01\/28\/dbpedia-2014-stats-top-subjects-predicates-and-objects\/","url_meta":{"origin":731,"position":4},"title":"DBpedia 2014 Stats - Top Subjects, Predicates and Objects","date":"2015-01-28","format":false,"excerpt":"Ever wondered what the top subjects \/ predicates \/ objects are in DBpedia? I recently came across this problem while trying to draw a random sample of nodes from DBpedia which follow a given degree distribution for my PhD. Turns out this is actually more difficult than i expected. Mostly\u2026","rel":"","context":"In &quot;Coding&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":337,"url":"https:\/\/joernhees.de\/blog\/2011\/01\/13\/betterrelations-some-updates\/","url_meta":{"origin":731,"position":5},"title":"BetterRelations (beta): some updates","date":"2011-01-13","format":false,"excerpt":"Well, in a hopefully last coding \"flash\" this night I included some frequently requested features, most important: a \"can't decide\" button: Enjoy ;) (also see the first post)","rel":"","context":"In &quot;LODgames&quot;","img":{"alt_text":"","src":"https:\/\/i1.wp.com\/joernhees.de\/blog\/wp-content\/uploads\/2011\/01\/screenshot_betterRelations_inRoundBarack1.png?resize=350%2C200&ssl=1","width":350,"height":200},"classes":[]}],"_links":{"self":[{"href":"https:\/\/joernhees.de\/blog\/wp-json\/wp\/v2\/posts\/731"}],"collection":[{"href":"https:\/\/joernhees.de\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/joernhees.de\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/joernhees.de\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/joernhees.de\/blog\/wp-json\/wp\/v2\/comments?post=731"}],"version-history":[{"count":13,"href":"https:\/\/joernhees.de\/blog\/wp-json\/wp\/v2\/posts\/731\/revisions"}],"predecessor-version":[{"id":742,"href":"https:\/\/joernhees.de\/blog\/wp-json\/wp\/v2\/posts\/731\/revisions\/742"}],"wp:attachment":[{"href":"https:\/\/joernhees.de\/blog\/wp-json\/wp\/v2\/media?parent=731"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/joernhees.de\/blog\/wp-json\/wp\/v2\/categories?post=731"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/joernhees.de\/blog\/wp-json\/wp\/v2\/tags?post=731"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}