Upgraded OpsMgr 2007 SP1 to OpsMgr 2007 R2

I did the upgrade today at work from Operations Manager 2007 SP1 to Operations Manager 2007 R2.  I followed the upgrade document by Microsoft.  It weighs in at 60 pages but that covers all sorts of scenarios.  I printed the document out and slowly worked through it.  It’s 60 pages so that’s a nice load going into the recycle bin.  It’s a very well written document so congrats to the team and the author, Christopher Fox.

I followed their precautionary backup steps.  But I also took a snapshot – yeap, our OpsMgr 2007 RMS is a virtual machine.  I checked out it’s utilisation of resources and found it was a candidate.  So I took a snapshot after the database backups/resizing and then continued the MS upgrade process.  I’ll leave the snapshot for a weekend to see how it goes.  The AVHD’s are slowing performance of the OpsMgr server but I’ll live with that in order to have a rollback option in the case of a critical issue.  If all is well in a few days I will merge the snapshot.

You need to have some disk space for this upgrade.  The guide recommends that the Operations and datawarehouse database have 50% free space.  The Operations database also needs a log file that is 50% of the size of the database file, e.g. the MDF is 50GB so the LDF should be 25GB.

The agent upgrade can be done a bunch of ways.  The easiest is to approve the updates via Pending Management in the console under Administration.  Note that agents accessed via a firewall will probably fail so you’ll have to do those manually.

I’m also using the Management Pack Templates to manage 3rd party services.  There’s a new option to target this to a group of machines.  I like that; it trims the fat.  I also use the Web Application templates.  They also need to be upgraded (edit properties, click Apply and agree to upgrade the management pack).  You’ll have to do that with each application.

Operations Manager 2007 R2 GA

OpsMgr 2007 R2 RTM’d about a month ago but was due to be GA on July 1st.  I checked that morning but Redmond doesn’t usually post things until 20:00 GMT or later.  I’ve been snowed under since and spent last night at a preview for the Bruno movie (laughed so much I started coughing my innards up).  I saw this morning Anthony Crottie posted that OpsMgr 2007 R2 is available.  I’m downloading it now from EOpen.  Check out my previous post for links to product documentation and descriptions of the new features.

Synthetic Transaction Monitoring

HP posted a blog entry where they talk about a survey they did last year – probably with very large customers.  They asked the companies if they used synthetic transaction monitoring.

What the hell is that?  The idea is that your monitoring solution can instruct an agent or agents to perform tasks against a business service to get a real world view of its health and performance.  This user perspective gives you so much more than just testing the up/down status of the web service or the SQL services.  It’s not a replacement for those monitors but it is a completion of the monitoring solution.

According to HP, “about half the respondents said they did. This ties in with a recent Aberdeen study that found 57% of companies didn’t do user experience monitoring”. 

If you’re using Microsoft System Center Operation Manager 2007 or 2007 R2 then you have the ability to do this out of the box for your web applications.  Not only that but you can also test TCP Ports and databases using OLE. 

OpsMgr includes a set of monitoring templates.  Using a simple wizard you can very quickly capture a browser session.  You will then assign this new monitor to an agent or agents and define how often it will be run.  This means that those agents will perform the web browsing session that you just captured.  If the session fails or is too slow then there will be an alert.  You can go on and build a distributed application using this new monitor.  That can combine the health of web services, SQL services, network devices and the user perspective for the application you are monitoring.  You can take this even further and use the new SLA monitoring pack to see if the application meets the availability requirements of your business or customers.

If you’re feeling really adventurous then try this.

image 

In this solution we have a web service that provides an application to customers on the Internet.  The monitoring solution is in the same site as the web service.  How can you get a users perspective on that when you don’t take the Internet into account?  The solution is simple: lease a virtual hosted server from a service provider.  Punch a hole through the firewalls to allow the encrypted agent traffic from the new hosted agent server to an OpsMgr management server.  The agent will be certificate enabled so it can reside outside of the AD forest, i.e. in a workgroup.  Now, if there is an Internet connection issue at the web server site you will get an alert from the agent.

We use synthetic web transactions at work for our customers.  We know immediately when there’s an issue.  In fact, we recently had an alert where a developer had made a mistake in a website.  Without the alert, it’s possible that this might have gone unnoticed by the hosting company for some time, thus losing customers. 

Check out the solution and test it out.  Once you’re happy add it in to your alert subscriptions and you’ll soon see how powerful this solution is.

EDIT:1

I should have added something in here.  What do you do if you’ve got no monitoring solution or if you’re in a (dedicated/virtual server) hosting environment with no monitoring, or worse, monitoring you don’t trust.  You can do something pretty similar to the above using an outsourced monitoring service.

image

In this solution the owner of a hosted web service has subscribed to an outsourced monitoring solution.  The OpsMgr server(s) perform normal agent monitoring using x.509 enabled agents or a OpsMgr Gateway on the hosted servers.  That just requires the one TCP port to be opened outbound on the hosting firewall.  The OpsMgr server also performs synthetic transactions against the web site(s).  Optionally, the customer could just get synthetic transaction monitoring to verify that the hosting operator is living up to their contracted SLA requirements.  You could even do this with a cheap’n’cheerful web hosting plan.

Service Level Dashboard 2.0 for Operations Manager 2007 R2 RTM

SLA reporting on services (“service” in the ITIL/MOF world is a service we provide to a customer/user, not a windows service) is critical in a service industry.  More businesses view their internal IT as a service.  Those of us in the hosting industry are fully aware of the contractual importance of the SLA.

OpsMgr 2007 can gather health/performance information on a “service” using a Distributed Application.  A DA assembles components, e.g. websites, SQL databases, network devices, etc in a “service” that you can alert on when it’s offline, e.g. a SQL DB in a DA going offline would bring down the customer service and thus be reflected in the status of the DA.  OpsMgr 2007 R2 can even include a hybrid of Windows, Linux and UNIX components into a single DA, e.g. a firewall running on Solaris, a MySQL database running on RedHat and a web server running on Windows.

With all this health, performance and availability information that we can store in the Reporting Database, we can generate reports and consoles for SLA reporting.  The SLA 2.0 console for OpsMgr 2007 R2 enables this.

It’s a free download so give it a look-see in your lab.  You can learn more about it here.

A New Version Of the Patch For OpsMgr 2007 IIS7 Monitoring

Microsoft released a replacement of the patch for W2008 IIS7 monitoring using Operations Manager 2007 SP1.  I personally don’t know what to think any more of this.  I’ve been burned by an update for W2008 failover clustering monitoring.  I’d been holding off on deploying the original patch for IIS7 monitoring.  Now there’s a replacement.  Ugh!

I think we need much better communications from the OpsMgr team and a clear web page (not plural) on what is required and what is not for each management pack to be supported and stable.  The management packs and their included documentation are becoming out of date within a matter of days.

Not Monitored Status In OpsMgr 2007

I put a number of servers into maintenance mode in Operations Manager 2007 today for some work.  When I terminated the maintenance mode setting a few of the servers stayed in a “Not Monitored” status in the OpsMgr Operations Console.  I rebooted one (because it’s service could tolerate 1 node being offline) and that sorted it out.  However, I couldn’t reboot any of the others.

The “Not Monitored” status usually means that the agent could not authenticate with the Management Server.  Strangely, I got no errors in the agent’s Operations Manager log.  The agent also had a green tick in the console.  After having dealt with a strange certificate issue in the past I knew there were two services to worry about on the agent machine:

  • Cryptographic Services
  • OpsMgr Health Service

Here’s how I reset the secure channel on the agent machine:

  • I put the agent machine into maintenance mode
  • I stopped the Cryptographic Services service
  • I stopped the OpsMgr Health Service service
  • I started the Cryptographic Services service
  • I started the OpsMgr Health Service service
  • I terminated maintenance mode for the agent machine

Within a minute it was back to normal and being completely monitored again.