NoSQL for Windows Server, Azure & SQL Server with Apache Hadoop
Microsoft promoted the Roadmap about the subject “Big Data” lately at the SQLPass. The Apache Projekt Hadoop will be a main part of this program.
Hadoop? Mhh….?
Hadoop is a Framework or a System which includes different components. The aim is to conduct and analyze huge (and also unsorted) files.
The project includes these subprojects:
· Hadoop Common: The common utilities that support the other Hadoop subprojects.
· Hadoop Distributed File System (HDFS™): A distributed file system that provides high-throughput access to application data.
· Hadoop MapReduce: A software framework for distributed processing of large data sets on compute clusters.
Other Hadoop-related projects at Apache include:
· Avro™: A data serialization system.
· Cassandra™: A scalable multi-master database with no single points of failure.
· Chukwa™: A data collection system for managing large distributed systems.
· HBase™: A scalable, distributed database that supports structured data storage for large tables.
· Hive™: A data warehouse infrastructure that provides data summarization and ad hoc querying.
· Mahout™: A Scalable machine learning and data mining library.
· Pig™: A high-level data-flow language and execution framework for parallel computation.
· ZooKeeper™: A high-performance coordination service for distributed applications.
Hadoop is developed with Java and his home is the world of Linux because of this I was surprised to hear this announcement.
Hadoop & Windows Azure/Server
According to the announcement Hadoop should be able to run on a Windows Server and it should be integrated into Windows Azure. The first Beta is planned to be published at the end of the year. Afterwards follows the going live in the next year.
What means huge files? Who use this?
Probably Facebook has the main Hadoop Cluster – in this Blogpost you will found some numbers and facts. Impressive.
Javascript is everywhere!
A little thing that makes me, as a developer, laugh (and which maybe drives a lot of DBA’s crazy):
For developers, we will enable integration with Microsoft developer tools as well as invest in making Javascript a first class language for Big Data. We will do this by making it possible to write high performance Map/Reduce jobs using Javascript. Yes, Javascript Map/Reduce, you read it right.
Ha!
Tool support
There should be connectors for the SQL server which are meant to manage the communication between the worlds of NoSQL and SQL. Even Excel and Co. should be able to use the new opportunities. More technical details you will find here.
Even if I’m not really affected by this news I think it is smart from Microsoft to look beyond their own nose. At least it has been the right decision in the past.
Keep it simple! YAGNI!
Evergreen on this blog is the Howto: 3-Tier / 3-Layers architecture. But in fact I had some doubts if this usual practice is the best way. Is a three layer architecture with an own DAL always recommendable? A few days ago I’ve read an interesting blog post which goes the same direction: Keep your code …
Use RavenDB as embedded Filebase
In my first post about this subject I’ve showed you how to start quickly with RavenDB and several ways of deployment. One option was to run RavenDB in the application – the advantage is that you don’t need a separate server even the Web-Admin-UI is able to activate because of that the usage in the …
Entrance into mongoDB
If you are interested in the trend? Subject NoSql you will also here something about MongoDB soon. What makes mongoDB interesting for me is the promised Performance (even If I didn’t reach the limits with normal data banks yet) and the promise to say goodbye to all this O/R Mapper stuff (here I reach …
Find out the Geo-coordinates of an address – geocoding with Google Maps
To find out the exact coordinates of an address there are numerous services available. Some ways goes through Javascript and the Google Maps “Plugin” others are reachable via a surface. The “smartest” (and cheapest / for free) alternative is via Google Maps Geocoding API. Request / Response The structure of the request is quite …
TeamCity & GitHub
Because of my latest fun with Git I’ve planned to connect it with the CI Tool of my choice. So what do I have to do if I want TeamCity to bring me the latest Sources? In fact it’s very easy… In the latest Version of TeamCity (at the moment it’s 6.5) the Git-Client …
WebApps easy deploying with the Windows Azure Accelerator for Web Roles
The definitely catchy name stands for a relative clever Deploymentmodell for Windows Azure. Base is the fact, that the Web Roll of Azure is able since a few versions are able to use the full IIS. That means, in one Web-Roll many websites are able to run. The configuration was a little bit bulky …
Google Dash/Dart, Go, a Cloud IDE „Brightly“ and the future of Javascript
Since a few days an embellished (?) Mail from Google buzz through the Internet and it includes some interesting Details about the future of Javascript. Also it includes an announcement for the presentation of the new language Dart on GOTO which is going to start at the 10th of October. The Problems of Javascript In …
Fix:“401 – Unauthorized: Access is denied due to invalid credentials“ on ASP.NET MVC & IIS 7
There are a lot of different reasons for this error message for example the IIS process has no access for the files or the password of the account where the application runs is wrong. All in all: it is a not very clear defined error message. Martin has already written about a very interesting problem …
NoSQL with RavenDB and ASP.NET MVC
For a loooong time (at least for me ) it was a fact that files have to be in a database. Usual files are saved in a relational database and linked. But in a while there exists resistance – NoSQL is the word. What are NoSQL database? There are several types of NoSQL database …




Recent comments