Originally posted by: Casey_B
Oracle should be pretty good about recovering from a forced exit.
Most enterprise level databases have transaction logging to be able to recover
the consistency of the database when killed.
From personal experience, I seem to remember that db2 was pretty good at recovering after being killed.
(In my clusters, if you didn't stop normally, you were killed, and shared memory removed, etc...)
db2 needed to be started with a flag to use those logs, and recover...
Maybe your Oracle start scripts aren't set up with the right flags to recover the database?
Back to a work around for your problem with the current design of HACMP....
I still think editing the HACMP scripts is the wrong thing to do.
Maybe, if your application stop script is going to fail....and fail in a way that you don't want
any further processing to occur....
Then maybe you send a page, email, etc, print a big message to the logs (The big message is important, so that you don't
have a co-worker forget what you had configured)...and stop the script from completing.
Maybe something like this:
echo "ERROR ERROR, stopping script execution"
read
or maybe
echo "ERROR ERROR....House is falling down"
sleep 99999
The script would wait for input on the standard in that it will never get.
Or it would wait for a very long time.
The cluster would wait for the script and enter into "config too long"
Why do I say "maybe" so many times? :)
Although I can understand making a choice for manual interventions...
There are some real dangers with not continuing with the fallover.
Even if the other node was able to start the application, and continue running...it wouldn't.
- This means possible longer times to recovery.
The application has not been killed, but is not running well.
- With the application not killed, it could be accepting incoming connections, not able to write to the
disk, and just losing data.
You have to evaluate anything I say with understanding of your environment, and your application.
Hope this helps
Casey
PS. Can I move some of this information into the PowerHA forum?