Storing PDF In Database Or Hard Drive - sql

I have a webpage that stores Scanned PDF files. At the moment I store it in the Sql Database with as a varbinary.
I am redesigning everything from scratch and was thinking it may be more efficient to store the PDF's on the hard drive and just have a path..
The PDF collection, potentionally could get to 500gig+..
which is more efficient and what are the advantages either way?

Look into using FILESTREAM to store the files in a database-accessible way on the file system: Using FILESTREAM to Store BLOBs in the NTFS File System in SQL Server 2008

I would use a persistent store such as AmazonS3. It's cheap, replicated, and if you change your database technology at some point in the future, it will be fairly easy. Instead of storing the files directly in your database, store the URL of the PDF on your AmazonS3 storage.

Related

How to store files in sqlite3 database

is there a way to store .txt or .pdf files within a table of my sqlite3 database?
yes is possible to store file in sqlite you can see this link to find how to store file
but I suggest you to don't do that because if you store some file in database, it's become too Heavy and slow to get query
the manageable situation is your save file in file manager and store file location address in database
Yes, you can store file in database in 2 ways
Store as binary or blobs
Store file in physical and path only store in database.
But both got some disadvantages.
If binary or blobs , database get heavy and make slow performance in query.
if file path only in database, when u backup and restore database in another place, then need to move physical file also and also anybody delete file from physical folder directly.
Is your requirements is small , then got with binary.let choose yourself.

Uploading a file to a VarBinaryMax field into Windows Azure?

I'm extremely confused, so I've created an SQL Database in Windows Azure, created a "video table" with a "video_file" column as "varbinary(max)" because I want to upload a video file into that field, however Azure offers no "Upload" option like say, PHPMyAdmin does where you can hit "browse" and upload a video directly into the field. Can anyone guide me as to how to actually upload a file into a Windows Azure SQL Database so it can be read as a varbinary type? Can it be done within the Azure management portal? Or does it require some sort of external program/service?
To answer your question, the functionality to upload files directly into SQL Azure Database does not exist. This is something you have to do on your own.
Can anyone guide me as to how to actually upload a file into a Windows
Azure SQL Database so it can be read as a varbinary type?
Do a search for uploading files in SQL Server and you will find plenty of examples on how to do that. Take a look at this link for example: http://www.codeproject.com/Articles/225446/Uploading-and-downloading-files-to-from-a-SQL-Serv
Can it be done within the Azure management portal? Or does it require some sort of external program/service?
No. This functionality does not exist in Azure Management Portal. As mentioned above, you would need to write some code to do so.
A little bit off-topic comment:
May I suggest that instead of saving the image files in the database you save them in Blob Storage and store the URL of the blob in your table. There're some advantages I could see in this:
Compared to SQL Database, Azure Blob Storage is much cheaper. If you store video files (or in other words large files) in the database, you will end up with large database and thus end up paying more money.
You will be choking the database when reading this large data from the database which will impact the performance of your application.

Will using FileStream in SQL Server increase database size?

I am using AppHarbor for hosting my website and currently it offers only 20MB free space. I want to store PDF files using SQL Server's FileStream column. Would using FileStream increase the database size because FileStream basically stores on hard disk as I know.
Any help would be appreciated.
Because the FILESTREAM data becomes part of the database, in the sense that it has its own FILEGROUP(s) and can (should be) be backed up through SQL backups - I would answer yes.
No it does not increase the size of the database as the files are not stored in the Database.
Please read the following "FILESTREAM Design and Implementation Considerations" --google is your best friend

Storing files in SQL server vs something like Amazon S3

Whats the advantage/disadvantage between storing files as a byte array in a SQL table and using something like Amazon S3 to store them? Whats the advantage of S3 that makes it so I should use that instead of SQL?
Pros for storing files in the database:
transactional consistency
security (assuming you need it and that your database isn't wide open anyway)
Cons for storing files in the database:
much larger database files + backups (which can be costly if you are hosting on someone else's storage)
much more difficult to debug (you can't say "SELECT doc FROM table" in Management Studio and have Word pop up)
more difficult to present the documents to users (and allow them to upload) - instead of just presenting a link to a file on the file system, you must build an app that takes the file and stores it in the database, and pulls the file from the database to present it to the user.
typically, database file storage and I/O are charged at a much higher premium that flat file storage

Uploading pictures to a path VS database

I am about to create an ASP.NET MVC app which will have over 2000 products and each products will approx 20 photos. The app will be asp.net mvc app and
I am using sql server 2008 r2 to manage my data. which way is the better approcah here;
Uploading pictures to a path and
storing their file names to database
in order to be able to make a
relation to each other.
Storing pictures inside the database
as byte as well and retreive them
from there when needded.
definitely in the filesystem (store path) is better, i have done both in the past.
Against SQL server to store images
A) betting data in and out can be more difficult as have to used blob type objects and some ORMs don't really cater for this
B) your data base is much bigger so effects your backup/restore policy. The more frequently you backup the better but space will be increased. Storing in file, yep you still need to backup but backing up filesystem is easy.
C) when you run out of storage space you just add another NAS drive / server and start storing images there, so scales horizontally
The common perception is not as good as data stored in two places but for me its better as the type of data in stored in the best storage medium for the data types ..
Definitely storing as a path rather than the byte array. This means you can easily change the actual image itself without having to alter any code or muck around in SQL (as long s the new file has the same name as the old one).
Hope this helps.
In the database using FILESTREAM which combines the 2 ideas (file and database)
FILESTREAM integrates the SQL Server Database Engine with an NTFS file system by storing varbinary(max) binary large object (BLOB) data as files on the file system. Transact-SQL statements can insert, update, query, search, and back up FILESTREAM data. Win32 file system interfaces provide streaming access to the data.
This changes the file vs database arguments
If you want to store paths only, then you'll have to accept the fact that images and database will get out of synch over time.