Strings in Sparql - sparql

I'm playing around with DBPedia.
With this query I get all people who were born in London:
SELECT ?person
WHERE {
?person dbo:birthPlace :London
}
But why I get an empty result when I execute this query?
SELECT ?person
WHERE {
?person dbo:birthPlace "London"
}
I just changed London to a String.

This is because the object of this relation is an entity, and not a string, hence the absence of result with the second query.
To know if a property (i.e dbo:birthPlace) relates an entity to a literal or not, one approach is to have a look at the "About" page of the property, for example, birthPlace's one.
What can be seen there is that the type of birthPlace is owl:ObjectProperty, meaning that the object of the relation will have to be an entity, defined with a URI.
The other possibility would be DatatypeProperty, as for the "abstract" property for example, where the object of the relation will be a literal.
The fact that the birth place is an entity allows a lot of things, such as retrieving specific information about that place in the same query, for example.
Hope that helps !

Related

Do all ontologies that import 'owl' or 'rdf', implement 'domain', 'range' and other related predicates?

Sorry if this is a noob's and simple question, but it will help me resolve a conceptual confusion of mine! I have some guesses, but want to make sure.
I got the location of a part of brain via NeuroFMA ontology and the query below:
PREFIX fma: <http://sig.uw.edu/fma#>
select ?loc{
fma:Superior_temporal_gyrus fma:location ?loc}
The result was: fma:live_incus_fm_14056
I thought I might be able to get some more information on this item.
Question 1: Was there a difference if the result was a literal?
So, I used optional {?loc ?p ?o} and got some results.
However, I thought since this ontology also imported RDF and OWL, the following queries should work too, but it was not the case (hopefully these codes are correct)!
optional {?value rdfs:range ?loc}
optional {?loc rdfs:domain ?value}
optional {?loc rdf:type ?value}
Question 2 If the above queries are correct, are RDFS and OWL just a suggestion? Or do ontologies that import/ follow them have to use all their resources or at least expand on them?
Thanks!
An import declaration in OWL is, for the most part, just informative. It is typically used to signal that this ontology re-uses some of the concepts defined in the target (for example, it could define some additional subclasses of classes defined in the target data).
Whether the import results in any additional data being loaded into your dataset depends on what database/API/reasoner you use to process the ontology. Most tools don't automatically load the targets of import declarations, by default, so the presence or absence of the import-declaration will have no influence on what your queries return.
I thought since this ontology also imported RDF and OWL, the following queries should work too, but it was not the case (hopefully
these codes are correct)!
optional {?value rdfs:range ?loc}
optional {?loc rdfs:domain ?value}
optional {?loc rdfs:type ?value}
It's rdf:type, not rdfs:type. Apart from that, each of these individually look fine. However, judging from your broader query, ?loc is usually not a property, but a property value. Property values don't have domains and ranges. You could query for something like this, possibly:
optional { fma:location rdfs:domain ?value}
This asks "if the property fma:location has a domain declaration, return that declaration and bind it to the ?value variable".
More generally, whether these queries return any results has little or nothing to do with what import declaration are present in your ontology. If your ontology contains a range declaration for a property, the first pattern will return a result. If it contains a domain declaration, the second one will return a result.
And finally, if your ontology contains an instance of some class, the third pattern (corrected) will return a result. It's as simple as that.
There is no magic here: the query only returns what is present in your dataset. What is present in your dataset is determined by how you have loaded the data into your database, and (optionally) what form of reasoner you have enabled on top of your database.

How to get specific datatype property using SPARQL in Protege

This is my ontology:
I have an individual and I set two different datatype properties: "code" and "EnglishName". In SPARQL, I can get all datatype properties of individual:
Query:
SELECT ?x ?y WHERE { uni:舌苔厚度厚 ?x ?y. ?x a owl:DatatypeProperty}
The question:
Why every datatype property appears twice in the result?
If I want to get value of one datatype property (not of all datatype properties), what do I suppose to do?
Thanks.

SPARQL for Pizza Ontology CQ

so I used pizza ontology, and tried to do some competence questions below using SPARQL
what kind of pizza are there?
is topping necessary for pizza?
here's what I've come to understand so far:
I try to list pizza's subclasses by this query
SELECT ?p WHERE { ?p rdfs:subClassOf pizza:Pizza }
it only display NamedPizza, I understand why, because NamedPizza is the only class that has direct subclass relationship, meanwhile other class like CheeseyPizza is owlEquivalentClass with certain attributes, and American is subclass of NamedPizza. So, if I want to list all the kind of pizza in this ontology, including CheeseyPizza and American, what query will it be?
related to 2nd question, since Pizza definition is only mention that it has PizzaBase, and PizzaTopping only mentioned in specific Pizza subclass/equivalentClass (for example: CheeseyPizza has CheeseTopping), how to test that a Pizza must have PizzaTopping or not?

DBPedia - Most relevant predicates per resource

I'd like to determine the most relevant properties / predicates (not objects) for any resource in DBPedia and Yago (e.g. the top 20). For instance, intuitively for a music artist you would be interested in his age, genre, music label, records etc.
What should a good algorithm look like to solve this problem?
My current naive approach is the following.
First I retreive all classes, ordered by their "size". (Warning, very expensive query!)
SELECT distinct ?class (count(distinct ?e) as ?c)
WHERE {
?e rdf:type ?class .
}
ORDER BY DESC(?c)
Then I make a query for each of those classes to get the number of entities within that class that have that certain property.
SELECT distinct ?prop (count(distinct ?e) as ?c)
WHERE {
?e rdf:type <--CLASS--> .
?e ?prop []
}
ORDER BY DESC(?c)
<--CLASS--> is replaced by the URI of the respective class. After some post-processing this gives me a list like this:
"dbo:Agent": {
"count": 1974654,
"properties": {
"http://www.w3.org/1999/02/22-rdf-syntax-ns#type": 399948,
"http://www.w3.org/2002/07/owl#sameAs": 67799,
"dbp:name": 22272,
"dbp:hasPhotoCollection": 13122,
"http://xmlns.com/foaf/0.1/givenName": 10799,
"dbo:birthPlace": 10055,
"dbo:birthDate": 9953,
"dbo:birthYear": 9735
}
},
"dbo:Person": {
count:
...
It tells me, which properties are most relevant for which class. Of course "meta" properties like http://www.w3.org/2002/07/owl#sameAs should be ignored in a later step.
However, entities are in multiple classes and potentially every of those is important and gives additional information. E.g. dbr:John_Lennon is (among others) in dbo:Person and dbo:MusicalArtist. I need to combine these classes' property rankings. I thought of the following approach, but I'm unsure if this is actually a reasonable solution.
So my idea was to compute relative weights for every property (e.g. propX in classA) by dividing the number of entities within classA that have propX by the total number of properties in classA. If I want to merge two classes then, e.g. classA and classB (or Person and MusicalArtist), I'd simply rank the properties of both classes in combination, ordered by their relative weights (is this a legit comparison?). If a property occurs in both classes, I'd compute the harmonic mean over both for the ranking.
Assuming the above steps would actually make sense (please let me know what you think), I got one more problem. I want to combine information from DBPedia and Yago, so for dbr:John_Lennon I want to fetch the equivalent (owl:sameAs) yr:John_Lennon from Yago. How can I merge the property ranking from both datasets to finally get a list of top 20 most relevant properties consisting of a mix of both DBP and Yago properties?

SPARQL query on Protege 4.3

The following query (preceded by relevant prefixes of course)
posed on an ontology (.owl file) gives the object properties or data properties?
SELECT DISTINCT ?predicate
WHERE { ?subject ?predicate ?object }
Thank you,
It wholly depends on what the data in the triple store or file contains. Variable ?predicate will match the predicate of the triple, and that predicate might be a datatype property or an object property in OWL, or neither of those if you're not querying an OWL ontology. Likewise, ?object will match an RDF resource or a literal, again depending on what the data says.