Howto create knowledge base from Wiki, URL behind a login - qnamaker

Need to create a knowledge base from a Wiki,however, the wiki is behind a login. What is the best way to feed a login protected URL into QnAmaker?
I've tried logging in in another tab on the same browser, but QnA maker simply shows the login content. I've quickly reviewed the QnA site howtos and searched stackoverflow. I'm sure folks have solved this issue but I haven't found any guidance.
If you have a link to online solutions, that would be wonderful.
Thanks...Steve

QnAMaker uses a service on its own server to scrape FAQs, which is why you can't login yourself and then try to scrape.
If you're able to whitelist IP addresses so the QnAMaker service doesn't have to login, QnA Maker Management Service uses the following IP Ranges:
West US: 13.91.0.0/16
East US: 52.191.0.0/18
Otherwise, your FAQ would have to allow login via URL. If this is possible, it's likely something like:
http://username:password#example.com/, or
http://myserver.com/~user=username&password=mypassword
The other option would be to manually copy the site to a supported file (like PDF or Doc) and then upload it.

Related

Webflow authentication integration with private API

The company I work for has an web application built with Angular, that has user authentication.
We also have a blog built with Webflow for simplicity.
The thing is, we want to create special pages on our blog only for premium users. For that, a user would need to sign in on the blog (webflow) using the same account they use on the main web application. After that, the blog would also need to know if they should have access to said pages (is a premium user), and then allow them to access such areas.
I've been looking for information about this, but I've been unable to locate a clear answer. I tried following this, but the GET request for https://webflow.com/oauth/authorize (using my own clientID) returns their home page. This can be seen on the printscreen below:
The request has the following format: https://webflow.com/oauth/authorize/?client_id=<CLIENT_ID>&response_type=code. It redirects twice (code 301 and 302), then just returns me their homepage.
In fact, I'm not even sure this oauth integration would solve my problem. Is this even achievable using webflow?

Can qnamaker.ai load a url not open for the public?

I'm setting up a QnA bot in Teams for internal use. However our FAQ page is whitelisted by IP and only accessible from the company office. I’ve tried whitelisting all known Azure service IP-addresses to no avail.
Is it possible to grant qnamaker.ai access to the private URL? If so what IP-address should be whitelisted? Thank you in advance.
QnA Maker Management Service uses the following IP Ranges:
West US: 13.91.0.0/16
East US: 52.191.0.0/18
qnamaker.ai uses a few others, depending on where you're accessing it from. If whitelisting the above IP ranges doesn't work, let me know what region you're trying to access it from and I can provide additional guidance.

IP authentication for library

I have very abstract description of the project. Ok, publisher wants to give access to several books to some library. He gives access by IP range (common method in publishing business, as I understand). So, users who will go to the library and use library's computers and browse book catalog can download these books.
Now question. This system should work if user login to the library from home computer (actually system like this already exists and I don't understand how it can work). And problems begin. User login to the library, but he will have his own IP address obviously. What can I do to give this users access to the books if I know only IP address range of the library.
I see 2 methods:
use "HTTP referer" field from the request. Which is very insecure
use third-party authorization meсhanism. I don't know exactly how I can do it, so I will appreciate any help
It looks like I'm trying to do pretty the same thing as this user -https://stackoverflow.com/questions/18999304/using-ip-to-identify-institution-school-university-library-etc
Anyway I found several references of IP Range authorization for libraries and universities. So, somehow it works. Can you explain how does it work for people who use their home computer?

SEOstats API Google Pagerank Blocked by Google

I wrote a php script to get all the urls' pageranks of my company website. But get the following response from GetWithCurl($url) - $str. It looks like Google has some restriction to get the pagerank dynamically.
Is there any way to resolve it? or contact with google? but how. Thank you!
Sorry...GoogleSorry...We're sorry...... but your computer or network may be sending automated queries. To protect our users, we can't process your request right now.See Google Help for more information.© 2009 Google - Google Home
Well.. That is not an issue with SEOstats. The problem is that Google detected that you send automated requests to it, which is against their Terms of Services!
You should be able to "solve" this by getting a fresh IP from your provider (turn router off/on) or sending your requests through proxies. Anyway, you must decrease your request frequency to avoid getting blocked again!
See: https://github.com/eyecatchup/SEOstats/issues/33

How to check restricted access pages for broken links?

I was googling for tools for checking broken links in a remote web page. The w3c validator seemed a good one. But I am still unsure as how to check for pages which are restricted, i.e. the pages which I can only access by logging in to the site. Can we do that using the w3c validator? If not than is there any other tool for the same?
For basic authentication the online validator will proxy it and prompt you to logon, alternatively see this post.
Sometimes you can specify the login details in the URL: username:password#url.to.the.site. This I believe only will work if you are using a .htaccess file for logins.