Quantcast
Channel: File Services and Storage forum
Viewing all 10672 articles
Browse latest View live

Server 2008 R2, iSCSI target lost causing SAP application crash on production server

$
0
0

Hello all,

Hopefully this is in the right section! :) Last monday we had a serious issue on our main production server. We're running Windows Server 2008 R2 (virtualised through Hyper-V 2008 R2), with iSCSI connections to our PS6000 and PS6100 Dell EqualLogic storage. For some reason that night the server lost connection to an iSCSI disk that contains all the .exe files for SAP causing the program to crash. I'm currently out of ideas and was hoping to get some input from others here =D.


Windows Server 2008R2 SP1 with the latest Windows updates installed.
iSCSI initiator driver version 6.1.7601.18386
EqualLogic firmware: V7.0.9 (R400081)
EqualLogic HIT tools on the server: 4.7.1
CommVault backup was running at that time (always scheduled for the time frame)


ID 29 1:53:26: Target rejected an iSCSI PDU sent by the initiator. Dump data contains the rejected PDU.
<EventData>
<Data>\Device\RaidPort0</Data><Binary>0000C00001000000000000001D0000C00000000000000000000000000000000000000000000000003F800400000000300000000000000000FFFFFFFF0000000000006803000067FF0000681F000000000000000000000000690071006E002E0032003000300031002D00300035002E0063006F006D002E0065007100750061006C006C006F006700690063003A0030002D003800610030003900300036002D006200340032003100650063003900300035002D0036003200380030003000300066003100650037006500340065006300340064002D006E006C006F006F0073007000720064002D00</Binary></EventData>



ID 32 1:53:26: Initiator received an asynchronous logout message. The target name is given in the dump data.
ID 23 1:53:26:All paths have failed. \Device\MPIODisk3 will be removed.ID 16 1:53:26:A fail-over on \Device\MPIODisk3 occurred.ID 15 1:57:26:The device, \Device\Harddisk4\DR4, is not ready for access yet.ID 12293 1:57:27: Volume Shadow Copy Service error: Error calling a routine on a Shadow Copy Provider {b5946137-7b9f-4925-af80-51abd60b20d5}. Routine details IVssSnapshotProvider::QueryVolumesSupportedForSnapshots(ProviderId,8388617,...) [hr = 0x80042302, A Volume Shadow Copy Service component encountered an unexpected error. Check the Application event log for more information.

What caused our SAP process to stop working:

ID 1000 1:57:47: Faulting application name: disp+work.EXE, version: 0.0.0.0, time stamp: 0x53594b41

Any help to put me into the right direction would be greatly appreciated! 

Kind regards,
Dennis Lans


DFSR replication stopped between sites after all servers updated (Event 1202)

$
0
0

Hello,

I'm afraid, i will greatly appreciate any help on this one.

I'm working on it since 2 days without success (I read many thread without help).

So the fact:

I have 2 AD (2008 R2) on site 1 and 2 AD (2008 R2) on site 2.
I have also 2 files servers (2008) on site 1 and 2 files servers (2008 R2) on site 2.

The files servers run DFS system.
DFS Namespace is host on all AD.
DFS Replication and share are on all files servers.

After update all my servers. I got a big problem.

Communication between files servers and AD of site 2 isn't working properly now.
By this fact, DFSR is not working anymore between site 1 (all seem fine on this site) and site 2.

DFSR on site 2, pop this events all time:

-----------------------------------------------------------------

Event 1202 - Source DFSR

The DFS Replication service failed to contact domain controller  to access configuration information. Replication is stopped. The service will try again during the next configuration polling cycle, which will occur in 60 minutes. This event can be caused by TCP/IP connectivity, firewall, Active Directory Domain Services, or DNS issues.
 
Additional Information:
Error: 160 (One or more arguments are not correct.)

-------------------

Event 1055 - Source GroupPolicy

The processing of Group Policy failed. Windows could not resolve the computer name. This could be caused by one of more of the following:
a) Name Resolution failure on the current domain controller.
b) Active Directory Replication Latency (an account created on another domain controller has not replicated to the current domain controller).

------------------------------------------------------------------

When i check C:\Windows\debug\Dfsr01000.log

20141128 11:10:46.731 1568 CFAD   311 Config::AdConnection::Connect Binding to dcAddr:\\10.98.131.178 dcDnsName:\\strmdc11.m1.streaming.win.p.fti.net
20141128 11:10:46.731 1568 CFAD   143 Config::AdConnection::BindToAd Trying to connect. hostName:strmdc11.m1.streaming.win.p.fti.net
20141128 11:10:46.731 1568 CFAD   162 Config::AdConnection::BindToAd Bound. hostName:strmdc11.m1.streaming.win.p.fti.net
20141128 11:10:46.731 1568 CFAD   199 Config::AdConnection::BindToDc Try to bind. hostName:\\strmdc11.m1.streaming.win.p.fti.net domainName:<null>
20141128 11:11:07.744 1568 CFAD  3371 [ERROR] Config::DsSession::Bind Failed to DsBind(). dc:\\strmdc11.m1.streaming.win.p.fti.net domainName:<null> Error:1722
20141128 11:11:07.744 1568 CFAD   215 Config::AdConnection::BindToDc (Ignored) Failed to bind. hostName:\\strmdc11.m1.streaming.win.p.fti.net domainName:<null> Error:[Error:1722(0x6ba) Config::DsSession::Bind ad.cpp:3378 1568 W The RPC server is unavailable.]
20141128 11:11:07.744 1568 CFAD   199 Config::AdConnection::BindToDc Try to bind. hostName:\\10.98.131.178 domainName:<null>
20141128 11:11:07.744 1568 CFAD  3371 [ERROR] Config::DsSession::Bind Failed to DsBind(). dc:\\10.98.131.178 domainName:<null> Error:87
20141128 11:11:07.744 1568 CFAD   215 Config::AdConnection::BindToDc (Ignored) Failed to bind. hostName:\\10.98.131.178 domainName:<null> Error:[Error:87(0x57) Config::DsSession::Bind ad.cpp:3378 1568 W The parameter is incorrect.]
20141128 11:11:07.744 1568 SCFS   150 [WARN] ServiceConfig::DsPollIsDue Failed to enable lightweight polling. Error:
+    [Error:160(0xa0) Config::AdConfig::ConnectToLocalDc ad.cpp:8399 1568 W One or more arguments are not correct.]
+    [Error:160(0xa0) Config::AdConfig::Connect ad.cpp:8147 1568 W One or more arguments are not correct.]
+    [Error:160(0xa0) Config::AdConnection::Connect adconnection.cpp:377 1568 W One or more arguments are not correct.]
+    [Error:160(0xa0) Config::AdConnection::BindToDc adconnection.cpp:226 1568 W One or more arguments are not correct.]
20141128 11:11:34.280 2156 SRTR   660 SERVER_EstablishConnection Replication group not found. connId:{34D20D20-54F9-4CDD-91A7-498ED802E40B} rgId:{484434EE-17D1-493A-84FE-2AEF7F696990}
20141128 11:11:34.280 2156 SRTR   841 [WARN] SERVER_EstablishConnection Failed to establish an outbound connection. connId:{34D20D20-54F9-4CDD-91A7-498ED802E40B} rgId:{484434EE-17D1-493A-84FE-2AEF7F696990} rgName: Error:[Error:9026(0x2342) SERVER_EstablishConnection servertransport.cpp:662 2156 C The connection is invalid]
20141128 11:11:36.558 2156 SRTR   660 SERVER_EstablishConnection Replication group not found. connId:{BE0E6769-B99B-4F16-83D6-DFDF635CF0E8} rgId:{A134B92B-3851-4690-A46F-BC103B9D74B9}
20141128 11:11:36.558 2156 SRTR   841 [WARN] SERVER_EstablishConnection Failed to establish an outbound connection. connId:{BE0E6769-B99B-4F16-83D6-DFDF635CF0E8} rgId:{A134B92B-3851-4690-A46F-BC103B9D74B9} rgName: Error:[Error:9026(0x2342) SERVER_EstablishConnection servertransport.cpp:662 2156 C The connection is invalid]
20141128 11:11:38.508 2156 SRTR   660 SERVER_EstablishConnection Replication group not found. connId:{411142A7-E0DE-43C5-92BD-7A72EAB63EA0} rgId:{684825B6-8BD3-4FD2-9DDF-3BF8368BFCB3}
20141128 11:11:38.508 2156 SRTR   841 [WARN] SERVER_EstablishConnection Failed to establish an outbound connection. connId:{411142A7-E0DE-43C5-92BD-7A72EAB63EA0} rgId:{684825B6-8BD3-4FD2-9DDF-3BF8368BFCB3} rgName: Error:[Error:9026(0x2342) SERVER_EstablishConnection servertransport.cpp:662 2156 C The connection is invalid]
20141128 11:11:44.467 2156 SRTR   660 SERVER_EstablishConnection Replication group not found. connId:{954E8C2E-2985-4BE5-8776-0751B6426918} rgId:{C7C06C94-36E2-4224-B811-A9970616C6F0}
20141128 11:11:44.467 2156 SRTR   841 [WARN] SERVER_EstablishConnection Failed to establish an outbound connection. connId:{954E8C2E-2985-4BE5-8776-0751B6426918} rgId:{C7C06C94-36E2-4224-B811-A9970616C6F0} rgName: Error:[Error:9026(0x2342) SERVER_EstablishConnection servertransport.cpp:662 2156 C The connection is invalid]

---------------------------------------------------------------

>dfsrdiag dumpadcfg /verbose
[INFO] Computer Name: FILERST13
[INFO] Computer DNS: filerst13.m1.streaming.win.p.fti.net
[INFO] Domain Name: m1
[INFO] Domain DNS: m1.streaming.win.p.fti.net
[INFO] Site Name: Montsouris
[ERROR] Failed to connect to AD for: m1.streaming.win.p.fti.net Err: 1355 (0x54b
)

[INFO] Execution Time: 0 seconds
Operation Failed

>dfsrdiag pollad /verbose
[INFO] Computer Name: FILERST13
[INFO] Computer DNS: filerst13.m1.streaming.win.p.fti.net
[INFO] Domain Name: m1
[INFO] Domain DNS: m1.streaming.win.p.fti.net
[INFO] Site Name: Montsouris
[INFO] Connected to WMI services on computer: filerst13.m1.streaming.win.p.fti.net
[INFO] Invoke PollDsNow() method on filerst13.m1.streaming.win.p.fti.net
[ERROR] PollDsNow method executed unsuccessfully. ReturnValue: 12 (0xc)
[ERROR] Failed to execute PollAD command Err: -2147217407 (0x80041001)

[INFO] Execution Time: 76 seconds
Operation Failed

On ADs site 2, dcdiag /e don't reveal any issue.

---------------------------------------------------------------------

I tried to install hotfix ref on this thread (without help) -> https://social.technet.microsoft.com/Forums/en-US/7d486eb5-6b03-471c-a4dc-65826e712fc3/dfsr-replication-event-id-1202-the-dfs-replication-service-failed-to-contact-domain-controller?forum=winserverfiles

I don't have issue with DNS (nslookup work fine).
Firewall are disable on all servers.

My problem looks a bit like here (but he don't speak about 2008 R2 - old article) -> blogs.technet.com/b/askds/archive/2011/04/08/restrictions-for-unauthenticated-rpc-clients-the-group-policy-that-punches-your-domain-in-the-face.asp

Any help will be greatly appreciate.

Fabien




commit changes on df win server 2012 r2

$
0
0

Hi All

I have 2 server with win 2012 r2, i am installing the Dfs on those.

the issue is on step, when  i am adding target folder, it give error..What can i do to solve this?


Best Regards, Stanley

Implementing Tiered Storage on Existing Storage Spaces Pool - Server 2012 R2

$
0
0

All,

I have an existing Storage Spaces pool on a Windows Server 2012 R2.  All drives are HDD, and the pool is divided into three volumes (1 mirror, 2 parity).  I am considering adding some SSD's for the performance increase by implementing tiered storage. However, while I have found quite a few guides detailing how to set up tiered Storage Spaces from scratch, I haven't been able to find one that describes how to add tiering to an existing pool.

My questions:

1.  Can you add an SSD tier to an existing Storage Spaces Pool?  If so, how?

2.  Is it destructive to existing data?  I will abandon the plan if it isn't transparent to existing data.

Thanks in advance!


Offline Files (Background) Sync makes File Explorer and other programs responding slow

$
0
0

Hi TechNet,

I have a problem with Offline Files and especially with VPN users.
For VPNusers, theslow-link policy is activated with abackground syncinterval of onehour. But, when Offline Files is syncing, their File Explorer is hanging (not responding) by creating new files/folders, deleting files/folder, or rename files/folders. This happens only in their (redirected)UserData folder or with other DFS shares which are cached locally. In other programs (Outlook or Office 2010) when they save data in the UserData folder exactly the same things happens (not responding).Thisproblemoccursin both theinternal network and viaVPN,sobandwidth isnot the problem. When the sync is finished, everything is working fine again.

I've tested:

- Offline mode with scheduled background sync: problem occurs
- Online mode with manual sync: problem occurs
- Rebuild offline cache database: no effect
- Tested with fresh Windows 8(.1) installations and user profiles: problem occurs
- Monitoring SMB traffic with WireShark: no result
- Montoring Windows 8(.1) clients with procmon: no result
- Tested with full DFS paths: problem occurs
- Disable IPv6: problem occurs
- Disable Windows Search: problem occurs

- Browsing in the Offline Files folder (Sync Center -> Manage Offline Files -> View Offline Files) is very slow (with +- 200 files). This occurs also when Offline Files is not syncing.

Does anyone have anidea how I cansolve this problem?

Thanks,
Bas

(Sorry for my bad English)


Cannot optimize volume - Receive error "The specified extrinsic Method does not exist"

$
0
0

When I try to optimize a volume on a server running Windows Server 2012 (R1). I receive the below error. This volume is an ISCSI target to our Compellent SAN. Other Servers don't have an issue with volumes created on the same SAN and I have already ran ChkDsk and it returned no errors. Oddly enough I can run a "Defrag T: /D" but a "Defrag T: /O" returns"Incorrect Function. (0x80070001)" (Incidentally an "Optimze-Volume T -Defrag" also works) 

Any Ideas? I'm stumped!

Command:

Optimize-Volume -DriveLetter T -Verbose

Error:

VERBOSE: Invoking slab consolidation on iSCSI_SAN01_ExchLogFiles (T:)...
VERBOSE: Slab Analysis:  100% complete.
optimize-volume : The specified extrinsic Method does not exist.
At line:1 char:1
+ optimize-volume -DriveLetter T -Verbose
+ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
    + CategoryInfo          : MetadataError: (MSFT_Volume (Ob...2-a802-ba7e...):ROOT/Microsoft/...age/MSFT_Volume) [Optimize-Volume], CimException
    + FullyQualifiedErrorId : MI RESULT 17,Optimize-Volume

Even though delete all of files on disk that the deduplicated volume but volume size could not be reduce.

$
0
0
When I enable data deduplicating on server 2012-R2  I can reduce data space on that volume. For exp. Before deduplication I have 5 GB data on disk. After deduplication It is reduce 2.5 gb data. And then I delete all of files and folders in that volume for test. So I expect to disk size is become  0 GB . But it still appears 2.5 GB using. But I wonder why that size didn't reduse.  Could anyone to help me.

free space on 250gb wont start new volume

$
0
0
hi i have a 250 GB Samsung external hard-drive but for some reason it only showed me 58.8 GB available so when i opened the disk manager it said there is 116.14 FREE SPACE so i tried making a new volume of that but it won let me it keep saying there is not enough space on disk to complete this operation i really need this space since im formatting my computer and i need that hard drive to take some of my stuff on the computer right now ill appreciate any help thank you

Recover deleted files from mirrored dynamic disks

$
0
0

Hi,

I need to recover files and folders that were deleted by robocopy. I was running a backup of existing photos from one server to another server (Windows server 2008) using the robocopy /mir command. The destination folder had an existing folder in it that contained other photos but because this folder was not in the source folder then it was deleted by robopcoy. The destiantion folder is on a software mirrored dynamic disk. I need to recover this destination folder, but have not had any luck. I have avoided writing any more data to the destination drive to prevent data loss. I am also not able to use the previous version option as this is not enabled.

So far I have tried using recuva, photorec and even a paid utility Active @ Uneraser but they cannot perform a straight forward undelete. I have to go to the deep scan method which doesn't recover the original file and path names. Even then I don't think I am getting the files I need back (there are a lot files to sort through) as the creation dates don't match up with the ones I am after. 

My real question is, does a dynamic mirrored volume allow undelete? So far it doesn't seem so. Is it possible that these files are completely unrecoverable? Any help would be appreciated.

Error removing physical disk marked as Retired

$
0
0
I have Server 2012 with one storage pool and one virtual disk. The virtual disk uses layout parity and thin provisioning. It contained 4 physical disks. One of the physical disks failed. It was pulled and larger replacement disk was added.

The server manager now lists the failed drive as "Retired". Every attempt to remove the disk results in:

Error removing physical disk: There was an error removing {179f49b7-7657-11e2-93ea-806e6f6e6963} (fileserver). One of the physical disks specified could not be removed because it is still in use.

If I check the properties of the virtual disk, it states under health: "Physical disks in use", and lists the retired drive as "Lost Communication".

The physical drives have lots of free space, and the new drive has been added to the storage pool (but not the virtual disk). The "repair virtual disk" option is grayed out.

It seems I cannot attach the virtual disk until I remove the retired drive.

How can a disk that's sitting unplugged in another room be "in use"? How do I remove the retired drive?


Unable to access DFS Folder

$
0
0

Hey Technet,

I have a strange problem while accessing dfs folders. As per below, when i try to access folders hosted on different nodes, im able to access, but when i try to access dfs folders hosted from the same node, it gives me the below error-

But when i try to access resources of different folder, im able to access-

C:\Users\Administrator>dir \\room.com\Root\Movies2
 Volume in drive \\room.com\Root has no label.
 Volume Serial Number is B45C-9EA6

 Directory of \\room.com\Root\Movies2

07/25/2014  07:05 PM    <DIR>          .
07/25/2014  07:05 PM    <DIR>          ..
06/10/2014  11:12 AM    <DIR>          13 Sins (2014)
01/23/2014  10:18 PM    <DIR>          22 Female Kottayam
05/21/2014  10:58 PM       734,958,866 2States (2014) [Hindi DVDScr - XviD - 1CDRip - 700MB].avi

Kindly address this issue.


Regards, Alan.

Windows Basic Disk with MBR Aligned to LBA 0

$
0
0

Guys, I have noticed that on some of my windows VMs, there are Windows Basic Disk with MBR and primary partitions aligned on LBA 0:

BlockSize  DeviceID               PrimaryPartition  StartingOffset  Type

512        Disk #0, Partition #0  TRUE              1048576         Installable File System

512        Disk #1, Partition #0  TRUE              1048576         Installable File System

512        Disk #2, Partition #0  TRUE              1048576         Installable File System

512        Disk #3, Partition #0  TRUE              0               Installable File System

512        Disk #4, Partition #0  TRUE              0               Installable File System

512        Disk #5, Partition #0  TRUE              0               Installable File System

512        Disk #6, Partition #0  TRUE              0               Installable File System

512        Disk #7, Partition #0  TRUE              1048576         Installable File System

512        Disk #8, Partition #0  TRUE              0               Installable File System

512        Disk #9, Partition #0  TRUE              1048576         Installable File System

How is this possible? Shouldn't the MBR be at LBA 0?

Windows 2008 Alignment of Drivers less than 4GB

$
0
0
Guys, why does Windows 2008/2008R2 align drives that are less than 4Gb to 65536 and all drives larger than 4GB to the 1MB boundary (1048576)? It's not uncommon on virtual machines to present small drives for holding logs etc. Also common when using SANS. I'm wondering what the decision process was behind this? I understand of course the reason to align to a 4K boundary due to modern driver sector size. 

DFS-R Restore Access To Path Denied Error

$
0
0

I am trying to restore the contents of ConflictsAndDeleted folder to a desired location withRestore-DfsrPreservedFiles cmdlet but the command breaks after restoring only a 7-level folder tree up to a file within the seventh folder with "Access to Path denied" error.

The path to the file is just 201 characters long so it is not a path too long issue.

Could someone be of help here please.


BPK

FSRM 2008R2 reports big different size with folder properties

$
0
0

Hi Everyone,

I'm having this issue and spend huge amount of time to search around but cannot explain why FSRM reports a big diffeent on used space compare to Windows Folder Properties.

As you can see following, Windows reports 513 GB but FSRM says662 GB

Based on my research, the difference is due to Windows block size and small file. Yes, we have around 150.000 small files 2KB, 10KB,20KB, etc. Our default block size on disk is 4KB.

Let say average each small files waste 2KB, the waste space on disk is 150.000 x2KB=300 MB only.

Where is approx 150 GB difference here?

How can we explain this. Can someone please help? Thanks

ps: I'm using Windows 2008R2, 4KB block as mentioned. No compression.




Unable to use a virtual disk on CiB as a Witness disk without making it a cluster resource on Windows 2012 R2 Enterprise Edition

$
0
0

Hello Everyone,

I hope someone can help us with this one. We have a customer with a 2-Node cluster in a SuperMicro CiB. The issue we are facing is that with Windows Enterprise edition we are unable to use a virtual disk on a CiB as a Witness disk without making it a cluster resource on Windows 2012 R2. We are able to successfully create a virtual disk, but it wont let me use it until I add it as a CSV. The issue is that if I add it as a CSV, then I cannot use as a Witness.

But, I can do the same when I am using the Windows 2012 R2 Storage Server. I can create a virtual disk and it shows up as available, which I can use as a witness disk. Is there a difference in the way Enterprise Edition works in this context?

I will try to post the snapshots, but it basically is that the virtual disk shows up as offline under the Failover Cluster Manager > Disks. It wont come online until I add it to the CSV. Any help in understanding this will be greatly appreciated.

Change: Sorry guys, not Enterprise edition, but Datacenter edition.


DFS Replication Setup Rrror on different domains : The server's operating system version cannot be retrieved. The network path cannot be found

$
0
0

We have old DFS servers (W2008) on one domain and I have just recently setup new W2012 R2 servers that will act as new DFS servers on a different domain. To tabularized it, let's say the following servers below, note they belong to the same forest:

DFS01.ASIA.WORLD  --> W2008 server on ASIA.WORLD domain

DFS02.ASIA.WORLD --> W2012 Standard server on ASIA.WORLD domain

DFS03.EUROPE.WORLD -->W2012 R2 on Europe domain

DFS04.EUROPE.WORLD -->W2012 R2 on Europe domain

If i configure replication group between DFS01 and DFS02 (both server on same domain), the setup works fine. If i try to configure replication group between DFS03 and DFS04( both server on same domain), it works fine as well.

But i try to setup DFS replication using the DFS new Replication Group Wizard on DFS01 (as source) to replicate on DFS04 (as destination). It throws me an error "The server's operating system version cannot be retrieved. The network path was not found". If i do it the other way around reversing the source and destination, it also gives me the same error.

The first thing that came to my mind is obviously trust relationship between domains. but both domains have been around for quite some time and there had been both users and workstations and servers that are existing on both domains and we have all been using and logging to them everyday.

On both servers, both Domain Admins group of ASIA and EUROPE are already added on both servers as Local Administrators and of course I'm logged on on both server using an account that is member of the Domain Admin group.

had tried searching Google and it only gives me one item about this, but does not contain helpful information.

Can you please direct me on items that i can check? thank you.

Windows server 2012 R2 hangs on boot when connecting two SAS cables

$
0
0

Hi,

I currently have a problem with my SOFS servers. I'm using 3 Dell Poweredge R720's with Dell 6Gbps SAS HBA cards inside to connect 3 DataOn 1640 JBOD's. I have installed all the latest (1 December 2014) windows updates, drivers and firmware in place.

When I connect both SAS cables to the HBA card Windows hangs at the boot screen (already waited 3 days to see if it just needed some time). As soon as I disconnect one of the SAS cables, the booting continues.

It doesn't matter which cable is connected, when they're connected seperately the server boots, when they're connected at the same time, the server hangs at boot time. This happens on all 3 servers. Any way to debug the bootproccess? ntbtlog.txt doesn't really tell me anything usefull...

Thanks in advance! 

Cannot Access DFS Replication performance counter

$
0
0

Hi 

I am setting up DFS Replication on Windows Server 2012 . I have given Accounts access to Performance log users group as well but still Im getting this error on the health report .



When I check the performance counter on the server locally by running this powershell . I can see performance data is avaialable . 

wmic path Win32_PerfFormattedData_Dfsr_DFSReplicatedFolders

Can anyone guide on this please.

Thanks

Mumtaz 


Server 2012 R2 File Server Stops Responding to SMB Connections

$
0
0

Hi There,

Massive shot in the dark here but I am struggling with a pretty major issue atm.  We have a production file server that is hosted on the following:

Dell MD 3220i -> iSCSI -> Server 2008 R2 Hyper-v Cluster -> Passthrough Disk -> Server 2012 R2 File Server VM

Essentially 3 times now, roughly a month or so apart.  The file server stops accepting connections.  During this time, the server is perfectly accessible through rdp or with a simple ping.  I can browse the files on the server directly but no-one appears to be able to access the shares over SMB.  A reboot of the server fixes the issue.  

As per a KB article I removed nod antivirus from the server to rule out a conflicting filter mode driver after the second fault.  Sadly yesterday it happened again.

The only relevant errors in the servers log files are:

SMB Server Event ID 551

SMB Session Authentication Failure Client Name: \\192.168.105.79 Client Address: 192.168.105.79:50774 User Name: HHS\H6-08$ Session ID: 0xFFFFFFFFFFFFFFFF Status: Insufficient server resources exist to complete the request. (0xC0000205) Guidance: You should expect this error when attempting to connect to shares using incorrect credentials. This error does not always indicate a problem with authorization, but mainly authentication. It is more common with non-Windows clients. This error can occur when using incorrect usernames and passwords with NTLM, mismatched LmCompatibility settings between client and server, duplicate Kerberos service principal names, incorrect Kerberos ticket-granting service tickets, or Guest accounts without Guest access enabled

and

SMB Server event ID 1020
File system operation has taken longer than expected.

Client Name: \\192.168.105.97
Client Address: 192.168.105.97:49571
User Name: HHS\12J.Champion
Session ID: 0x2C07B40004A5
Share Name: \\*\Subjects
File Name:
Command: 5
Duration (in milliseconds): 176784
Warning Threshold (in milliseconds): 120000

Guidance:

The underlying file system has taken too long to respond to an operation. This typically indicates a problem with the storage and not SMB.

I have checked the underlying disk/iscsi/network hyper-v cluster for any other errors or issues, but as far as I can tell everything is fine. 

Is it possible that something else is left over from the NOD antivirus installation?  

Looking for suggestions on how to troubleshoot this further.

Thanks


Viewing all 10672 articles
Browse latest View live


<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>