How can I access PubMed's database using BigQuery - google-bigquery

When accessing google_patents_research.publications table from BigQuery I can filter results by the top_terms column. This column uses Google's machine learning algo to search the text inside a patent and extract the words it finds most meaningful.
I was wondering if there is a similar search using BigQuery that can be done on PubMed's database.
According to NCBI docs there is a dataset called nih-sra-datastore which should be accessible from BigQuery, but it doesn't show up when searching for it in BigQuery's console search box.

If you are referring to nih-sra-datastore as explained on this page The National Center for Biotechnology Information it looks that its making reference to the project-id not the dataset as such.
So in order to have access to it you have to follow these steps (on BigQuery explorer):
Click on +ADD DATA
Hover over Pin a Project and select Enter a Project Name
Add nih-sra-datastore
You will pin this public project to your BigQuery explorer.

Related

How to activate table preview menu in Bigquery Console?

I started BigQuery trial account just before
and created sample table using a source with Drive URI & CSV.
When I clicked my uploaded table, it shows table information like schema and details.
I remember that there was a preview menu but I can't find it now.
Is there anyway to activate table preview menu?
When your table is created from a file from Google Drive, by default "Preview" tab is not available since it is considered as an external table. I tested it to verify the behavior.
Created table sourcing from Google Drive:
What I could suggest is to load your data from Google Drive to BigQuery, so the "Preview" tab will be available. Also if you load your data in BigQuery, running queries will be faster compared from querying the table externally.

BigQuery Connected Sheets - Required user permissions?

I have a view that is connected to a google sheet via connected sheets.
I'm trying to let a user refresh the data by giving them access in GCP.
I've tried giving access at the project, dataset and view levels. But every time they get the error: "Query failed, no access to the connected BigQuery table"
I'm giving the role of bigquery.user and bigquery.dataviewer.
What could be causing this?
Please, make sure if the user you wish to give access to the BigQuery data in Google Sheets has:
An Enterprise Plus or G Suite Enterprise for Education account
Access to BigQuery
A project with billing setup in BigQuery
A BigQuery Job Creator role on the selected billing project
BigQuery Data Viewer role on the datasets containing the selected table
According to the documentation:
If you share a sheet with someone who doesn't meet the criteria
above, they'll be able to see analysis created with Connected Sheets
and perform regular Sheets operations, but they won't be able to
refresh it or create their own connected sheet.
Additionally, have a look for the another SO thread and Using Connected Sheets documentation.

SESSION_USER equivalent from Big Query in Data Studio reports

We are creating dashboards for clients using data studio.
Each client should see their data in the dashboard, based on their login credentials. It is simple to create an authorized_view in Big Query to let certain users see certain rows of an underlying shared table. But how would one achieve to then move this into a dashboard which can be shared with each client, yet show only the individuals client in the dashboard instead of the data that was visible to the report creator?
So let's say we have a large table with a bunch of columns and one column email which contains the email of users. Now, we want the dashboard to show metrics for each user based on this email column.
On DataStudio in the datasource schema review step, make sure the flag USING VIEWER’S CREDENTIALS is on. By turning it on, the query when being executed will use the viewer’s credential instead of the owner who created the report.
After you finish create proper visualization on Data Studio, final step is to share the report to eg: store managers using the share option of Data Studio which is similar to share a Google Docs. You can confidently share it with the whole organization or with the email group of eg: store managers, permission already be controlled at data level.
Read more about this topic here.

How can I customize a data set for Google BigQuery? Can I export a file? How do I test it to see if it meets my needs?

I would like to improve the quality of existing data by using the Google BigQuery API to help validate the accuracy of existing data.
I dont see information on the types of data elements contained in the BigQuery and dont understand how to use an API if I just want to see what types of data are contained in there.
I tried looking for instructions and data elements in the Google Health Care API and Google BigQuery documentation and only saw how to set up a payment option.
I am a newbie at programming and wanted to do some preliminary research on these data sets prior to bringing them to our technical team.
I expect to see a list of relevant results based on a custom query.
You can see the data types supported
by Google BigQuery here and the conversion between different types here.
Also you can try out the BigQuery APIs in the OAuthPlayGround.

google-bigquery

I am using BigQuery for SEO reasons. I am a search TC and I am a little confused why you are not using the Google Forum as I thought that was standard. What I want to use BigQuery for is to pull when my competitors change data on their website and which pages that were changed. So I need the URL that was changed and when it was changed (date) so I can also pull the page title and description to see what they are doing different than I am.
Is there anyone that knows how to use BigQuery to pull:
Date the page was changed
URL
Title
Description
We've switched to using Stack Overflow for support for many of our developer products, such as BigQuery. There's a great community here on StackOverflow, and the interface for formatting technical questions and interacting with the community is fantastic.
BigQuery does not collect the data for you-- it's a cloud service for performing ad hoc queries on massive datasets. Before first performing the queries, you need to upload the data to the service (as a CSV format).
So, if you have a job which collects this data -- URL, title, description, date and perhaps a hash of the webpage, you could potentially ingest a CSV file of this data into BigQuery and use it to understand when webpages have changed.
Of course, there are also 3rd-party services (such as Changedetection.com) which may be easier to use for your purposes.