Hi all,
one major thing, which we are still checking is to go for CC-BY instead
of CC-BY-SA . All properties/objects should be fine except for abstracts
or anything that contains long texts.
All the best,
Sebastian
On 28.04.2017 16:46, Markus Freudenberg wrote:
Dear all,
Another DBpedia release is dawning and soon will be published in full.
In the meantime, we will forward the main body of data for the coming
2016-10 release:
http://downloads.dbpedia.org/2016-1
<http://downloads.dbpedia.org/2016-04/>0/
This release-cycle took somewhat longer than the last ones for
multiple reasons:
1.
We started extracting the full texts of each wiki page in addition
to the abstracts in the NLP Interchange Format(NIF
<http://persistence.uni-leipzig.org/nlp2rdf/ontologies/nif-core/nif-core.html>),
providing the readable text structured in sections and paragraphs
and all in-text links (see here
<http://downloads.dbpedia.org/2016-04/ext/nif-abstracts/>: careful
the linked datasets on this page represents data of the last
release). The additional (nif-) datasets will double the released
data dumps in size.
2.
We are preparing a major overhaul of the data extraction procedure
based on SPARK <http://spark.apache.org>, in cooperation with the
Semantic Web Company <https://www.semantic-web.at>, which
necessitates extended refactoring of the current code-base.
3.
We focused on actively gathering incoming links of other datasets
to return the favour by turning them around as outgoing links.
This is an ongoing process, which will update the links on a
monthly basis.
<http://downloads.dbpedia.org/links/2017-04-01/dbpedia.org/>
Please have a closer look at the current status of the data, so we can
catch missing or odd data points before publishing the data.
What is still missing:
*
Additional types (SDTypes, Hypernyms, DBTax)
*
Additional datasets for DBpedia+
*
Release statistics
*
download page
*
No public endpoint yet with the data
In case you missed the changes in the last release (2016-04):
*
In addition to normalized datasets to English DBpedia (en-uris) we
additionally provide normalized datasets based on the DBpedia
Wikidata (DBw) datasets (wkd-uris). These sorted datasets will be
the foundation for the upcoming fusion process with wikidata. The
DBw-based uris will be the only ones provided from the following
releases on.
*
We now filter out triples from the Raw Infobox Extractor that are
already mapped. E.g. no more “<x> dbo:birthPlace <z>” and “<x>
dbp:birthPlace|dbp:placeOfBirth|... <z>” in the same resource.
These triples are now moved to the “infobox-properties-mapped”
datasets and not loaded on the main endpoint. Seeissue 22
<https://github.com/dbpedia/extraction-framework/issues/22>for
more details.
*
Major improvements in our citation extraction. Seehere
<http://www.mail-archive.com/[email protected]/msg07762.html>for
more details.
Markus, on behalf of the DBpedia extraction team.
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
DBpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
--
All the best,
Sebastian Hellmann
Director of Knowledge Integration and Linked Data Technologies (KILT)
Competence Center
at the Institute for Applied Informatics (InfAI) at Leipzig University
Executive Director of the DBpedia Association
Projects: http://dbpedia.org, http://nlp2rdf.org,
http://linguistics.okfn.org, https://www.w3.org/community/ld4lt
<http://www.w3.org/community/ld4lt>
Homepage: http://aksw.org/SebastianHellmann
Research Group: http://aksw.org
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
DBpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion