Does the pagelinks table contains redirects? - sql

I want to count how many inbound links each MediaWiki page have. I want to know if I need to consider redirects in pagelinks table or not.
MediaWiki database have pagelinks table which have all links included. This table have three attributes 'pl_from', 'pl_namespace' and 'pl_title'. I want to extract links from pagelinks.sql file from MediaWiki database dump.
Do you know if 'pl_from' can have id of redirect page and 'pl_title' can have redirect title?

For the purposes of pagelinks, redirects are treated as normal pages.
So, for example, if you have a redirect page with the following code:
#REDIRECT [[Some page]]
[[Another link]]
Then there will be two entries for this page as pl_from: one for Some page and another for Another link. What this means is that a redirect will always have a row for its target page, but it can also have other rows.
And pl_title contains the title of a linked page based on the code of the page. So, if there is a page with a link to a redirect, the title of that redirect will appear as pl_title.

Related

Canonical tag for content split across multiple pages

We have pages which have been split into multiple pages as they are too in depth. The structure currently...
Page (www.domain.com/page)
We have split this up like so...
Page + Subtitle (www.new-domain.com/page-subtitle-1)
Page + Subtitle (www.new-domain.com/page-subtitle-2)
Page + Subtitle (www.new-domain.com/page-subtitle-3)
I need to know the correct way of adding in multiple canonical tags on the original page. Is it search engine friendly to add say 3/4 canonical tags linking to 3/4 separate pages?
Well, this is what you should do -
Keep the complete page even if you are dividing into component pages.
Use rel="next" and rel="prev" links to indicate the relationship between component URLs. This markup provides a strong hint to Google that you would like it to treat these pages as a logical sequence, thus consolidating their linking properties and usually sending searchers to the first page.
In each of the component pages, add a rel="canonical" link to the original (all content) page. This will tell google about the all content page.
This strategy is recommended by google - read here.
Canonical tags are basically to consolidate link signals for duplicate or similar content. With that said, you are not supposed to have multiple canonical tags in a page. You have two options.
If your old page is going to go away, then you should pick one primary page(in the split pages) and do a 301 redirect, so the SEO value are carried over to that new primary URL.
If its going to stay, you can create internal links to the new pages. But make sure the content is different, so that it does not count as duplicate pages.
Hope this helps.

How do I setup a robots.txt which allows all pages EXCEPT the main page?

If I have a site called http://example.com, and under it I have articles, such as:
http://example.com/articles/norwegian-statoil-ceo-resigns
Basically, I don't want the text from the frontpage to show on Google results, so that when you search for "statoil ceo", you ONLY get the article itself, but not the frontpage which contains this text but is not of the article itself.
If you did that, then Google could still display your home page with a note under the link saying they couldnt crawl the page. This is because robots.txt doesnt stop a page being indexed. You could noindex the home page, though personally I wouldnt recommend it.

HTML SEO : Why my page identified as two different pages

I use sites that check out my website for SEO And I get the following message:
"his page title is not unique. Assign unique, descriptive TITLE tags and headings to every page."
accessbar.co.il
accessbar.co.il/index.aspx
But it should be the same page.
Your home page can be pulled up using two different URLs:
accessbar.co.il/
accessbar.co.il/index.aspx
They may be the same page to you, but to search engines they are two separate URLs and thus considered two separate pages. You should do one or more of the following:
do a 301 redirect from accessbar.co.il/index.aspx to accessbar.co.il/
put a canonical URL on accessbar.co.il/index.aspx pointing to accessbar.co.il/

Redirect .htaccess to different page on different domain

I tried searching for similar posts but none of them satisfied my need. Hence I have to ask a new question
I have some domain space on my university website like this <university>.edu/~<myname> which is actually <university>.edu/~<myname>/index.html
I want all the traffic from here to go to <myname>.wix.com/resume. But I don't want other pages like my <university>.edu/~<myname>/somethingelse to redirect anywhere.
Also I want the address bar to still show <university>.edu/~<myname> even when it has readirected to my site on wix.com.
Kindly let me know if this is possible.
Thanks in advance.
No browser that I know of will allow itself to be redirected to a different site, without displaying the URL of the new site in the address bar. However, there are ways that you can display content from <myname>.wix.com/resume on your page at <university>.edu/~<myname>/index.html. One way is to use a frame (or an iframe) on <myname>.wix.com/resume which would load content from <university>.edu/~<myname>/index.html. Another (perhaps more seamless) solution would be to use a server-side scripting language (such as PHP) to create a script at <university>.edu/~<myname>/index.php which would download the content 'on the fly' from <myname>.wix.com/resume and display it at <university>.edu/~<myname>/index.php.

How to index Dynamic url page

I have a site that look like this:
Main page (index.php)
where a user can research a business.
To display result at page (search/index.php), the page url will look something like this:
/search/index.php?what=plumber&where=montreal&page=1
The page will generate dynamic link to access the business profile found. When you click on one of those links, you get here:
/entreprises/index.php?companyName=Something
How can I get the previous link to be indexed on Google?
This appears to be a good situation for the canonical URL tag. You'll want to specify a single URL for the page in question, and then other variations of that url getting to the same page (i.e. searches, tags, etc.) will refer to the stated canonical url as the actual url for SEO purposes.
Google has a detailed description of the tag here:
https://support.google.com/webmasters/answer/139394?hl=en