Skip to main content

BIG DATA

OBJECTIVES : 

-> What is big data?
-> How to handle big data?
->Solutions for big data


BIG DATA:
Big data is a broad term for data sets so large or complex that traditional data processing applications are inadequate. Challenges include analysis, capture, data curation, search, sharing, storage, transfer, visualization, and information privacy.

                  Big data is a massive volume of data that refers to both structured and unstructured and difficult to process.It has mainly three aspects.They are:
             
                            1)velocity
                            2)volume
                            3)variety
velocity: size of the data.Big data is so massive that it gets accumulated at large amounts in very less amount of time.

volume: Size of the data is very large that it can't be processed using conventional methods.

 variety: Big data deals with both structured and unstructured data.



How to handle big data:

                                 there are many solutions provided to handle big data efficiently over many years.
 Companies like yahoo,facebook, google and many more people all around the world have worked to provide solutions to this problem.Some of the solutions avalable for big data are hadoop,pig,hive,spark.

Solutions for big data:

                    Hadoop is a open source project developed by Apache with the white paper algorith released by google. Google released a algorithm which explains how the search gaint handles large amounts of data.
Code is developed for same by the developers of yahoo and many people all over the world.

Hadoop uses map reduce function and HDFS for processing big data.  




 




Comments

Popular posts from this blog

An TLS 1.2 connection request was received from a remote client application, but none of the cipher suites supported by the client application are supported by the server. The TLS connection request has failed.

If the certificate being used on the server was generated using the Legacy Key option in the certificate request form, the private key for that certificate will be stored in Microsoft's legacy Cryptographic API framework. When the web server tries to process requests using its new, Cryptographic Next Generation (CNG) framework, it appears that something related to the RSA private key stored in the legacy framework is unavailable to the new framework. As a result, the use of the RSA cipher suites is severely limited. To avoid the issue, you can try to generate the certificate request using the CNG Key template in the custom certificate request wizard.

SQL Server

                                                                     SQL Server  Its been a while that I have updated my blog. Though Databases and SQL was something that I used to stay away as much as possible because for some reasons, I got to work on these all the day for 8-9 hours and sometimes haunting in the nights and weekends as well. However, it has been a good journey so far and I found some interesting stuff in SQL Server. So, this triggered an idea a couple of weeks back to share some cool stuff that I am learning in SQL Server over numerous sources. Hope I can make this a good series. As there are not much visitors to my blog, let me start with wishing me a good luck. So, here starts the SQL Server Series!!!!!!

Tail Log Backups

Tail log backups capture the tail of the log even if the database is offline, damaged, or missing data files. This might cause incomplete metadata from the restore information commands and msdb. However, only the metadata is incomplete; the captured log is complete and usable. If a tail-log backup has incomplete metadata, in the backupset table, has_incomplete_metadata is set to 1. Also, in the output of RESTORE HEADERONLY, HasIncompleteMetadata is set to 1. To create a tail log backup, use below script BACKUP LOG [DB Name]    TO [TLog_Device_Location]      WITH NO_TRUNCATE;   GO Scenarios That Require a Tail-Log Backup If the database is online and you plan to perform a restore operation on the database, begin by backing up the tail of the log. To avoid an error for an online database, you must use the ... WITH NORECOVERY option of the BACKUP Transact-SQL statement. If a database is offline and fails to start and you need to restore the database, first back up the ta