Posts tagged as:

IT

Cloud Computing (aka Internet based Services)

by Carl Belso on January 3, 2010

…and that Cloud looks like a highly redundant, geographically dispersed computer cluster…

Technology marches forward and the paradigm changes. The latest new paradigm – Cloud Computing! Cloud Computing is going to completely change the way we design our computing infrastructures, again.

Cloud Computing is the concept of using Internet services to support users. In general the term is applied more toward business users, but I would say that we all use Internet (or cloud) based services. One of the best and most successful cloud computing services is web-based email or webmail. Gmail, Hotmail, and Yahoo Mail are all examples of webmail services – and these services are examples of Cloud Computing.

There are also some specific types of cloud computing models.  These are basic subsets of cloud services and are different in the sense that they are usually purchased and have SLA (Service Level Agreements) associated with them.  The models are Software as a Service (SaaS), Infrastructure as a Service (IaaS), and Platforms as a Service (PaaS).

SaaS

Why do it in the Cloud?

In general cloud based services are more robust than similar in-house solutions. Yes, cloud services fail, but probably less often than similar services a business could deploy. Gmail for example has very high availability numbers. In fact, Gmail claims 99.9% availability, which is about an hour of down time per year,  less than 5 minutes a month.

Cloud computing is also less expensive.  Using cloud based services can save a lot of money in terms of capital investments and labor. Consider the costs of running an internal email server – hardware, software, system administration, networking, spam management, and so on. The costs of cloud based solution, the cost of the service and maybe a system administrator to help the users.

Why not to do it in the Cloud…

There are some very serious risks associated with Cloud usage. First and foremost is security. Cloud computing require you to rely on the cloud provider to address security.  Add to this the fact that the service is Internet accessible and security becomes a challenge. Recently Twitter felt the effects of failed security in its usage of cloud service. Without the benefits of hiding behind a firewall, cloud based services are more susceptible to hacking simply due to accessibility. It is critical to educate your users of these new challenges.

Another potential issue is data portability. Can you get your data out of the cloud? Unluckily, the answer is usually no. The ability to extract your data for archiving or backups is an important consideration when deciding whether or not to use cloud services and when selecting a provider. A recent move by Google to address data portability is a move in the right direction and will hopefully encourage others to follow suit. But the standard answer for most cloud computing solutions is “all your data are belong to us.

Welcome to the Future…

And Cloud Computing is there. The use of cloud computing is almost completely ubiquitous. Most Internet user take advantage of cloud services daily.  In fact, I suspect that most web usage is primarily focused on the use of cloud service; email, work-related activities, blogs, etc. The Cloud is the Internet and Cloud Computing is its new purpose.

What is next? More Cloud Computing power! Google Chrome OS? Maybe…

Post to Twitter Twitter Post to Delicious Delicious Post to Digg Digg Post to Facebook Facebook Post to Reddit Reddit Post to StumbleUpon StumbleUpon

{ 3 comments }

Network Management – The Art of Enterprise Monitoring

by Carl Belso on November 30, 2009

“That which is not monitored is not managed.” - A wise System Administrator

MonitorFailure is a part of life. This is especially true in the world of IT. It is not a question of if, but when. The key to successful enterprise management is to know when things fail. This can only be accomplished through monitoring. The name of this art – Network Management. Well, actually it is more than network management, but I guess this is the title it gets because of its roots; kind of like the way we still say we are dialing the phone.  A better name would be – Enterprise Management, which is starting to catch on, but the old IT folks won’t understand what you are talking about.

Monitoring the Enterprise

If a tree falls in the forest and no one hears it, did it really fall?  Well, I don’t know the answer to that, but I do know that if a system fails, someone is going to hear it. The goal of the IT staff is to be the first one to hear it (or better yet, know that it is going to fall/fail). Nothing is more painful than having your customer point to a fallen tree and ask you if you heard it. You must listen to your forest (Enterprise).

Referring back to the Enterprise Architecture post, I view systems as collections of services. Going a bit further, services are composed of components. These components are computers, switches, storage (SAN, NAS), and a bunch of other stuff. Therefore; systems are composed of services, and services are composed of components. And, the collection of our systems, services, and components is our Enterprise.

So, what do we monitor? Simple, as much as we can – systems, services, and components. To simplify this discussion, let’s look at this in terms of levels; Systems, Services, and Components. At the top level, Systems, we are checking for functionality. For example, if the system was a website, we could perform an HTTP get to check the functionality of the site. To accomplish more detailed monitoring we might craft a special HTTP request that would exercise the services that make up the site. The data returned from this HTTP request could then be analyzed to determine if the site was operating normally. In the case of Services, well, basically we are doing the same thing, keeping in mind that services are systems. So looking at the website again, we connect to its database and run some queries to gather status information. For the Components we can use SNMP to collect a whole bunch of data. In the case of a computer we would collect CPU data, disk information, memory usage, process data, and more. For switches and routers; system performance (CPU and memory), port information (usage, up/down status, error counts, etc.), and routing data (updates, errors, etc.).  The more data we collect, the more likely we are to identify issues.

Relationships -

At each level we are collecting data that will be use to determine the operational status of all of the parts that make up a system; as well monitoring the system itself. Why not just monitor the system? Monitoring all of the supporting services and components allows us to quickly address the actual cause of a problem. If we know what caused the problem, we can fix it. The website is down is not enough information. Is the actual problem the server, switch, load balancer, router, database, firewall, or the user? Not knowing leaves a lot of things to check. If we are monitoring all of the supporting services and components we will know what is wrong and our efforts to fix the problem can be focused on what is broken.

When we monitor the Enterprise in terms of its architecture, we do not really need the “system” level monitoring, because we will be monitoring all of the services that comprise the system. Well, ok this is not really completely true. The point is that in most cases, if we are monitoring components and services, problems that affect systems will be identified at a lower level. And if the relationship of services to systems is know, and maybe even incorporated into or monitoring tool(s), we will understand why a system is down based on lower level issues. This is the goal – service and component level monitoring of the systems, for the whole Enterprise.

Reality –

All right, here is the truth, you can never get there. The problem you run into is Zeno’s paradox of Achilles and the tortoise. Once you monitor half of the stuff, there is still half to go. You can get close, but close is all you can do. That being said, it is well worth the effort. Most basic stuff can be addressed quickly and easily with open source tools like Zenoss and Nagios. And as your monitoring solution matures it will become more effective. Just keep in mind that the task is never done. The more you monitor, the less you will miss.

Navstar has successfully implemented Enterprise level monitoring and capacity planning solutions for our customers. At the Department of Treasury we implemented a complete solution based on open source tools. At the core of the solution are Zenoss and Cacti. We can help you get there too.

So what are you waiting on? Start monitoring.

- Carl

Relationships -

The point is that at each level we are collecting data that will be use to determine the operational status of all of the parts that make up a system; as well monitoring the system itself. Why not just monitor the system? Monitoring all of the supporting services and components allows us to quickly address the actual cause of a problem. If we know what caused the problem, we can fix it. The website is down is not enough information. Is the actual problem the server, switch, load balancer, router, database, firewall, or the user? Not knowing leaves a lot of things to check. If we are monitoring all of the supporting services and components we will know what is wrong and our efforts to fix the problem can be focused on what is broken.

When we monitor the Enterprise in terms of its architecture, we do not really need the “system” level monitoring, because we will be monitoring all of the services that comprise the system. Well, ok this is not really completely true. The point is that in most cases, if we are monitoring components and services, problems that affect systems will be identified at a lower level. And if the relationship of services to systems is know, and maybe even incorporated into or monitoring tool(s), we will understand why a system is down based on lower level issues. This is the goal – service/component level monitoring of the Enterprise systems.

Reality –

All right, here is the truth, you can never get there. The problem you run into is Zeno’s paradox of Achilles and the tortoise. Once you monitor half of the stuff, there is still half to go. You can get close, but close is all you can do. That being said, it is well worth the effort. Most basic stuff can be addressed quickly and easily with open source tools like Zenoss and Nagios. And as your monitoring solution matures it will become more effective. Just keep in mind that the task is never done. The more you monitor, the less you will miss.

Navstar has successfully implemented Enterprise level monitoring and capacity planning solutions for our customers. At the Department of Treasury we implemented a complete solution based on open source tools. At the core of the solution are Zenoss and Cacti. We can help you get there too.

So what are you waiting on? Start monitoring.

- Carl

Post to Twitter Twitter Post to Delicious Delicious Post to Digg Digg Post to Facebook Facebook Post to Reddit Reddit Post to StumbleUpon StumbleUpon

{ 0 comments }

What is Enterprise Architecture?

November 18, 2009

I typically try to avoid “buzz” terms like Enterprise Architecture, but sometimes you just need a term. This is one such case. It is either use a term, or paragraphs of explanations. This is intended to be the paragraphs to define the term, Enterprise Architecture.

The term Enterprise Architecture (EA) is not well [...]

Read the full article →

Green IT Solutions at Navstar

November 9, 2009

Navstar is introducing its new Green IT program and it’s ambitious as it is serious. Our mission is to partner with the Federal Government to create:
Sustainable Green IT Solutions
This includes supporting sustainable purchasing, responsible e-cycling, and greening IT operations.
As we begin this program its important that we honor the work already being done in [...]

Read the full article →
Get Adobe Flash playerPlugin by wpburn.com wordpress themes