Cluster issue in SQL Server 2000 SP4 +Windows 2003 SP2 –Help required | SQL Server Performance Forums

SQL Server Performance Forum – Threads Archive

Cluster issue in SQL Server 2000 SP4 +Windows 2003 SP2 –Help required

Dear All,
I have a issu with my A/P cluster which was build with Windows 2003R(SP2) and SQL Server 2000 SP4, every once in a month my Cluster services restarting the SQL Server services and I am unable to understand what problem it is ? can some body help me to resolve the problem.
Cluster log info
2011/10/10-20:03:07.519 ERR SQL Server : [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
2011/10/10-20:03:07.519 ERR SQL Server : [sqsrvres] printODBCError: sqlstate = HYT00; native error = 0; message = [Microsoft][ODBC SQL Server Driver]Timeout expired
2011/10/10-20:03:07.519 ERR SQL Server : [sqsrvres] OnlineThread: QP is not online.
2011/10/10-20:03:07.519 ERR SQL Server : [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
2011/10/10-20:03:07.519 ERR SQL Server : [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][ODBC SQL Server Driver]Communication link failure
2011/10/10-20:03:07.519 ERR SQL Server : [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
2011/10/10-20:03:07.519 ERR SQL Server : [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][ODBC SQL Server Driver]Communication link failure
2011/10/10-20:03:07.519 ERR SQL Server : [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
2011/10/10-20:03:07.519 ERR SQL Server : [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][ODBC SQL Server Driver]Communication link failure
2011/10/10-20:03:07.519 ERR SQL Server : [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
2011/10/10-20:03:07.519 ERR SQL Server : [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][ODBC SQL Server Driver]Communication link failure
2011/10/10-20:03:07.519 ERR SQL Server : [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
2011/10/10-20:03:07.519 ERR SQL Server : [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][ODBC SQL Server Driver]Communication link failure
2011/10/10-20:03:28.550 ERR SQL Server : [sqsrvres] ODBC sqldriverconnect failed
2011/10/10-20:03:28.550 ERR SQL Server : [sqsrvres] checkODBCConnectError: sqlstate = 08001; native error = b; message = [Microsoft][ODBC SQL Server Driver][DBNETLIB]General network error. Check your network documentation.
2011/10/10-20:03:28.550 ERR SQL Server : [sqsrvres] ODBC sqldriverconnect failed
2011/10/10-20:03:28.550 ERR SQL Server : [sqsrvres] checkODBCConnectError: sqlstate = 01000; native error = 274c; message = [Microsoft][ODBC SQL Server Driver][DBNETLIB]ConnectionOpen (PreLoginHandshake()). Please could you guys help me how to resolve this problem
Dear Forum gurus, Please help me regarding above posted issue,If requires I will share more information which ever asked.
Dear all, I same issue next day of erlier failure,My SQL Server got re-started
2011/10/11-19:53:41.532 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
2011/10/11-19:53:41.532 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = HYT00; native error = 0; message = [Microsoft][ODBC SQL Server Driver]Timeout expired
2011/10/11-19:53:41.532 ERR SQL Server <SQL Server>: [sqsrvres] OnlineThread: QP is not online.
2011/10/11-19:53:41.532 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
2011/10/11-19:53:41.532 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][ODBC SQL Server Driver]Communication link failure
2011/10/11-19:53:41.532 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
2011/10/11-19:53:41.532 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][ODBC SQL Server Driver]Communication link failure
2011/10/11-19:53:41.532 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
2011/10/11-19:53:41.532 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][ODBC SQL Server Driver]Communication link failure
2011/10/11-19:53:41.532 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
2011/10/11-19:53:41.532 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][ODBC SQL Server Driver]Communication link failure
2011/10/11-19:53:41.532 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
2011/10/11-19:53:41.532 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][ODBC SQL Server Driver]Communication link failure Identified No high CPU/No high Memory/No high connections –connections are 255 only and no Blocking issue found in the server. Please some body help me to resolve the issue..
Dear Satya/Hemanth/Luis/Frank, Please some body help me or guide me..how to get rid of this problem. Thanks in advance
Snivas
Dear all, I same issue today also
Date: 11/17/2011
Time: 3:00:01 PM
Description:
18265 :
Log backed up: Database: DB_MAIN, creation date(time): 2009/07/27(04:57:49), first LSN: 1366249:15679:1,
last LSN: 1366254:4803:1, number of dump devices: 1, device information: (FILE=1, TYPE=DISK: {‘R:\MSSQLBACKUP\MAIN\DB_MAIN_tlog_201111171500.TRN’}). Date: 11/17/2011
Time: 3:06:52 PM
Description:
[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed Date: 11/17/2011
Time: 3:06:52 PM
Description:
[sqsrvres] printODBCError: sqlstate = HYT00; native error = 0; message = [Microsoft][ODBC SQL Server Driver]Timeout expired Date: 11/17/2011
Time: 3:06:52 PM
Description:
[sqsrvres] OnlineThread: QP is not online. When every I got the issue Just before the issue there are Transactional backup were happining. Can any body help me …
Did you checked if there is an indication of an issue with a hearbeat network / resource which is triggering this ? Is there a pattern that you are seeing ? i.e. every monday ? after n number of hours ?
in my recent assignment we saw the kind of issue and the culprint was the lan cable of the heartbeat network, and in another case its SAN drive issue, so please check that too.
Many Thanks Hemanth, There is no specific time and no specific period as well,most of times this failure was happened when the user are high ( almost above 400)…."select count(*) from sysprocesses where spid>50".and cpu between 25% to 40% and no change in memory. One most imp point I want mention here that is the server memory never changed after restart server.that means If my server memory 15GB before restart after restart also server memory immediatly showing 15GB. Please hemanth can you us to get rid of this problem and let us know how show heart beat problem to system team.(If I say heart beat problem they will come come again we checked ….. it is good). Regards
Snivas
I will check with Network team ansd storage team. Thanks a lot
]]>

Software Reviews | Book Reviews | FAQs | Tips | Articles | Performance Tuning | Audit | BI | Clustering | Developer | Reporting | DBA | ASP.NET Ado | Views tips | | Developer FAQs | Replication Tips | OS Tips | Misc Tips | Index Tuning Tips | Hints Tips | High Availability Tips | Hardware Tips | ETL Tips | Components Tips | Configuration Tips | App Dev Tips | OLAP Tips | Admin Tips | Software Reviews | Error | Clustering FAQs | Performance Tuning FAQs | DBA FAQs |