Any idea how to find a website written on ROR?
I mean just looking at urls ... how I can identify the written in ROR ?
Thanks for your time .
Just looking at URLs? Impossible. Most Rails apps these days have this:
<meta name="csrf-param" content="authenticity_token"/>
<meta name="csrf-token" content="y3DSVtX1Q2ZFviGXhCWCjjFaoRJImq7hJAWrZugudLU="/>
You could check the HTML source for that.
No way by looking at URLs, and I'm pretty sure it's also impossible by looking at source code since it's all HTML.
Related
I am working on one website, I have set all the required this for seo and looks all things are fine.
â–ºBut from long time, I have seen that deep product links are not going to index in Google as well some of the catagory pages.
Here is the site https://www.tradohub.com
What should I do for Google to index my web site? if someone can help.
Thanks
You have a canonical tag issue with your internal pages.
Let's take this page as an example...
https://www.tradohub.com/product-detail/raw-cashew-nuts
Reading the source code you have a canonical of...
<link rel="canonical" href="https://www.tradohub.com/" />
This tells search engines the page is a copy of the home page URL so will most likely not get indexed.
The canonical should be...
<link rel="canonical" href="https://www.tradohub.com/product-detail/raw-cashew-nuts" />
The canonical tags are the issue here so best fix them up.
Here's Google's info on the subject. https://support.google.com/webmasters/answer/139066?hl=en
Hope this helps.
William
I am using cloudflare and it is enhancing my website performance, but rocketscript is adding script at the top of page and I think
it is hurting my SEO
<meta http-equiv="X-UA-Compatible" content="IE=edge"> <script type="text/javascript">
//<![CDATA[
try{if (!window.CloudFlare) {var CloudFlare=[{verbose:0,p:1417339881,byc:0,owlid:"cf",bag2:1,mirage2:0,oracle:0,paths:{cloudflare:"/cdn-cgi/nexp/dok2v=1613a3a185/"},atok:"d1860f61ecdb8c5c3826b01316cc8742",petok:"1fe291a32e5813fc73d592d28d6e79cdc75592ce-1417361255-1800",zone:"altibbi.com",rocket:"a",apps:{}}];document.write('<script type="text/javascript" src="//ajax.cloudflare.com/cdn-cgi/nexp/dok2v=919620257c/cloudflare.min.js"><'+'\/script>');}}catch(e){};
//]]></script>
<script type="text/rocketscript">(window.NREUM||(NREUM={})).loader_config={xpid:"VQQPU1NSGwcDVVhRAAY="};window.NREUM||(NREUM={}),__nr_require=function(t,e,n){function r(n){if(!e[n]){var o=e[n]={exports:{}};t[n][0].call(o.exports,function(e){var o=t[n][1][e];return r(o?o:e)},o,o.exports)}return e[n].exports}if("function"==typeof __nr_require)return __nr_requ........
How to remove it ? And why it is adding it ?
This message is for the people jumping this old thread to get rid of CDATA code from the site.
Just go inside the Page Rules.
Create a rule for your site : http://< mysite >/*
Then add the following :
Disable Apps
Smart Errors Off
Rocket Loader Off
Under free account you can create 3 pages rules. Pretty much enough for doing stuffs.
The Rocket Loader script wouldn't really impact SEO at all. Why do you believe it is? You can always turn that feature off in your performance settings.
I have over 11,000 pages in a site and need to change the tag on every page. I interact with production through horribly slow tool so doing a global search and replace is out of the question. The site is in a .net environment and I utilize vb so I am wondering if I could rewrite the tag sitewide on the fly with a class?
Thanks
Yes. It's called response filtering. Please take a look at this:
http://msdn.microsoft.com/en-us/library/system.web.httpresponse.filter.aspx
For example when you Google "stackoverflow" the first result links to http://stackoverflow.com with a description, "A language-independent collaboratively edited question and answer site for programmers."
How is Stack Overflow or any other site working to set that description? I am about to put a website online and would like to get a good description of my site indexed. What steps do I need to take to accomplish this?
One way of achieving this is by using the <meta name="Description"> tag as such:
<head>
<meta name="Description"
content="A language-independent collaboratively edited question and answer site for programmers.">
</head>
On SO, that tag seems to appear only when you are using a user agent related to a crawler.
It's a bit hit and miss as to whether Google displays it but these are set using the 'description' meta tag on each page. If Google doesn't think your entire page is relevant to the search query, it will show a 'snippet' instead. Keep your meta description short and relevant (and non-spammy) and it should show within a day or so of being crawled by Google.
Hope that helps
Google has a help page on the subject of site descriptions.
For your own site, you may also want to add a site map for google and others to use.
I have recently started using Google Webmaster Tools.
I was quite surprised to see just how many links google is trying to index.
http://www.example.com/?c=123
http://www.example.com/?c=82
http://www.example.com/?c=234
http://www.example.com/?c=991
These are all campaigns that exist as links from partner sites.
For right now they're all being denied by my robots file until the site is complete - as is EVERY page on the site.
I'm wondering what is the best approach to deal with links like this is - before I make my robots.txt file less restrictive.
I'm concerned that they will be treated as different URLS and start appearing in google's search results. They all correspond to the same page - give or take. I dont want people finding them as they are and clicking on them.
By best idea so far is to render a page that contains a query string as follows :
// DO NOT TRY THIS AT HOME. See edit below
<% if (Request.QueryString != "") { %>
<META NAME="ROBOTS" CONTENT="NOINDEX, NOFOLLOW">
<% } %>
Do I need to do this? Is this the best approach?
Edit: This turns out NOT TO BE A GOOD APPROACH. It turns out that Google is seeing NOINDEX on a page that has the same content as another page that does not have NOINDEX. Apparently it figures they're the same thing and the NOINDEX takes precedence. My site completely disappeared from Google as a result. Caveat: it could have been something else i did at the same time, but i wouldn't risk this approach.
This is the sort of thing that rel="canonical" was designed for. Google posted a blog article about it.
Yes, Google would interprete them as different URLs.
Depending on your webserver you could use a rewrite filter to remove the parameter for search engines, eg url rewrite filter for Tomcat, or mod rewrite for Apache.
Personally I'd just redirect to the same page with the tracking parameter removed.
That seems like the best approach unless the page exists in it's own folder in which case you can modify the robots.txt file just to ignore that folder.
For resources that should not be indexed I prefer to do a simple return in the page load:
if (IsBot(Request.UserAgent)
return;