google rich snippet how to display job list? - google-rich-snippets

How can i let "google rich snippet" display this format such as the following?
rich snippet display the "Job Title", "Company", "Location", "Posted"
glassdoor jobs - Computerworld
25+ items - 5158+ glassdoor jobs available on Computerworld.
Job Title Company Location Posted.
Senior Software Engineer ... Riverbed Technology Sunnyvale, CA Aug 09.
Senior Java Software Engineer Glassdoor.com Manhattan, NY Aug 17.
is it use microdata, mircoformat or RDFa?
or need to write the specific HTML structure?
i know the JobPosting of microdata, but i think this format is more better to me.
Thanks for your help!

it's "bulleted snippets".
※ Use a consistent structure, whatever it is.
※ Keep extraneous code to a minimum.
※ Test removing your META description or setting it to “”.
http://moz.com/blog/how-do-i-get-googles-bulleted-snippets
http://insidesearch.blogspot.tw/2011/08/new-snippets-for-list-pages.html

Related

how to get table info and summary of page using Wikipedia api?

I want to get minimal information of a Wikipedia page using MediaWiki API like DuckDuckGo. For example for Steve Carell: https://duckduckgo.com/?q=steve+carell&t=hp&ia=news&iax=about
How can I get this information with a Wikipedia url (eg https://en.wikipedia.org/wiki/Steve_Carell) in HTML format?
You can use the MediaWiki API for that. There's an extension, TextExtracts, which is exactly for that (and it is installed on Wikipedia).
In your case, e.g.:
https://en.wikipedia.org/w/api.php?action=query&prop=extracts&exsentences=1&titles=Steve%20Carell
will return something like:
<p class=\"mw-empty-elt\">\n</p>\n\n<p class=\"mw-empty-elt\">\n \n</p>\n<p><b>Steven John Carell</b> (<span></span>; born August 16, 1962) is an American actor, comedian, producer, writer and director.</p>
You can customize how many sentences (or characters) the API returns, as well, please consult the API documentation for that.
There's also the way to retrieve the short description, which is saved at Wikidata (and visible in the mobile view of Wikipedia). This call would be:
https://en.wikipedia.org/w/api.php?action=query&prop=pageprops&titles=Steve_Carell
This returns the following property in the pageprops of the page:
"wikibase-shortdesc": "American actor"
This may fit better depending on your use case.
You can even get both of the results with a single, combined, request:
https://en.wikipedia.org/w/api.php?action=query&prop=extracts|pageprops&exsentences=1&titles=Steve_Carell

Extract data with HTMLAgilityPack – simple example

I've searched the net and can not find simple HTMLAgilityPack example to extract 1 information from webpage. Most of the examples are in C# and code convertors don't work properly. Also developer's forum wasn't helpful.
Anyways, I am trying to extract “Consumer Defensive” string from this URL “http://quotes.morningstar.com/stock/c-company-profile?t=dltr” and this text “Dollar Tree Stores, Inc., operates discount variety stores in United States and Canada. Its stores offer merchandise at fixed price of $1.00 and C$1.25. The company operates stores under the names of Dollar Tree, Deal$, Dollar Tree Canada, etc. “ from same webpage.
Tried code on this link : https://stackoverflow.com/questions/13147749/html-agility-pack-with-vb-net-parsing but GetPageHTML is not declared.
This one is in C# HTML Agility pack - parsing tables
and so on.
Thanks.
The HTML returned from that URL is translated to XML with 2 root nodes, so it can not be transformed directly to an XML document.
For the values you wish to retrieve it may be easier to simply retrieve the HTML document and search for the start and end tags of the strings you wish to extract.

How do I access the "See Also" Field in the Wiktionary API?

Many of the Wiktionary pages for Chinese Characters (Hanzi) include links at the top of the page to other similar-looking characters. I'd like to use the Wiktionary API to send a single character in the query and receive a list of similar characters as the response. Unfortunately, I can't seem to find any query that includes the "See Also" field. Is this kind of query possible?
The “see also” field is just a line of wiki code in the page source, and there is no way for the API to know that it's different from any other piece of text on the page.
If you are happy with using only the English version of Wiktionary, you can fetch the wikicode: index.php?title=太&action=raw, and then parse the result for the template also. In this case, the line you are looking for is {{also|大|犬}}.
To check if the template is used on the page at all, query the API for titles=太&prop=templates&tltemplates=Template:also
Similar templates are avilable in more language editions of Wiktionary, in case you want to use other sources than the English one. The current list is:
br:Patrom:gwelet
ca:Plantilla:vegeu
cs:Šablona:Viz
de:Vorlage:Siehe auch
el:Πρότυπο:δείτε
es:Plantilla:desambiguación
eu:Txantiloi:Esanahi desberdina
fi:Malline:katso
fr:Modèle:voir
gl:Modelo:homo
id:Templat:lihat
is:Snið:sjá einnig
it:Template:Vedi
ja:テンプレート:see
no:Mal:se også
oc:Modèl:veire
pl:Szablon:podobne
pt:Predefinição:ver também
ru:Шаблон:Cf
sk:Šablóna:See
sv:Mall:se även
It has been suggested that the WikiData project be expanded to cover Wiktionary. If and when that happens, you might be able to query theWikiData API for that kind of stuff!

Docx4J Open XML

I am reading some .docx files with Docx4J which contains hyperlinks.
I am getting URLs while clicking on those hyperlinks manually but when i am trying to read those file with Docx4J i am getting only text nothing about those Hyperlinks and URLs.
Document Text -
Infosys Chairman, KV Kamath said that IT services were facing challenges of scalability. Speaking at the 31st Annual General Meeting of the company in Bangalore, Kamath said the management has met all the challenges successfully and demonstrated leadership. Infosys, India's second largest IT services company announced a final dividend of Rs 22/share. The company also announced a special dividend of Rs 10/share on account of the 10th year of operations of the Infosys BPO. Speaking at the AGM, S D Shibulal, CEO of Infosys said that transformation is complete and the company is now focussed on growth. "Infosys 3.0 will help company address challenges," said Shibulal. Shibulal said: "We had a choice between commoditization and re-defining the industry. We chose to redefine the industry."..more
Hyperlink is on "more"
Docx4J is giving the text 'more' only. It is not giving information regarding that hyperlink.
Is there any way how to get that URL??
Please Help...

Setting options in Apache Stanbol request

I am trying to learn semantic-web where I can pass the sample text.
which is normally in English.
I am using IKS project to learn.
http://dev.iks-project.eu:8080/engines
Is there a way to set that I should return enhancments only in English and
if in sample test a word e.g London is repeated 10 Times it return also in RDF contents London 10 time. Can I get Unique suggestions back?