I put this together in a hurry and may have forgotten things, but this is my general assessment.
-
DC6 - As the dcdiag section below shows, it looks like DC6 has the problematic SYSVOL, which is why the NETLOGON share is not available (it's a subfolder called Scripts under Sysyol\tommy.local\sysvol\scrips)
Starting test: NetLogons
* Network Logons Privileges Check
Unable to connect to the NETLOGON share! (\\DC6\netlogon)
[DC6] An net use or LsaPolicy operation failed with error 67,
The network name cannot be found..
-
DNS settings:
This is based on best practices...
As for DC6's DNS settings, on DC6, go into the NIC, IPv6 properties, and set DNS to obtain automatically so the ::1 doesn't show up as a DNS address. Then remove its own IP, (172.17.0.149), since that's redundant with the loopback.
On delldc's DNS settings, change it so 172.17.0.149 is the first, and the loopback is the second.
-
Repl Latency
There are a bunch of latency retired vector invocations showing, which means there have been replication problems between them. One of the main causes of replication issues is antivirus. Uninstall any antivirus on the DCs during the troubleshooting process.
Check with the vendor to find out how to configure exculsion on a DC.
-
DellDC -
Dcdiag DFS errors: Did you try the recommendation in the dcdiag to reinitialize it?
"To resume replication of this folder, use the DFS Management snap-in to remove this server from the replication group, and then add it back to the group. This causes the server to perform an initial synchronization task, which replaces the stale data with
fresh data from other members of the replication group. "
But if the other DC doesn't have a set to replicate, then that won't work. Was a DFS migration attempted? If so, was it ever successful?
How to attack this depends on the EventID that is associated with it.
See if this discussion helps:
DFS Replication Service stopped on one folder, with Error 9098 (associated Event ID 4004, but there may be others associated with it)
http://social.technet.microsoft.com/Forums/windowsserver/en-US/f8f62854-84b3-4998-9aae-04830fb126fe/dfs-replication-service-stopped-on-one-folder-with-error-9098?forum=winserverfiles
See this article for a hotfix:
The DFS Replication service may stop responding when it initializes the replication process for the replicated folders on a computer that is running Windows Server 2003 R2, Windows Server 2008, or Windows Server 2008 R2
http://support.microsoft.com/kb/977381
-
Time service and Virtualization
Another thing that could have caused the whole thing is the time service. On HyperV, VMware or Xen,, you must disable time sync, otherwise, if time is synced from the host and throws off the time more than 5 minutes, then that can cause authentication problems.
Disable time sync on all hosts.
-
Which DC is the PDC? That must be configured with an external time source. Reconfigure it as such:
============
If you've experimented changing time settings to unknowlingly avert default behavior, you can set the time settings back to
default:
1. On the DC that you're experiencing issues with, run the following in a command prompt:
net stop w32time
w32tm /unregister
w32tm /register
net start w32time
2. On the Server in question, run the following in a command prompt:
"net time /setsntp: " (without the quotes, but I put that in there to signify the blank space prior to the closing quote)
[This tells the client (whether a DC or workstation) to delete the current registry settings for time and use
default settings.]
Restart the time service:
Net stop w32time && net start w32time
3. On the PDC Emulator run the following in a command prompt:
W32tm /config /manualpeerlist:time.windows.com /syncfromflags:manual /reliable:yes /update
W32tm /resync /rediscover
Restart the time service:
Net stop w32time && net start w32time
4. On each DC that are not holding the PDC Emulator role, run the following in a command prompt:
w32tm /config /syncfromflags:domhier /update
W32tm /resync /rediscover
Restart the time service:
Net stop w32time && net start w32time
5. check the new configuration:
W32tm /query /configuration
w32tm /query /source
W32tm /monitor
6. This will also take out any errors in the Event Viewer, if there were any.
============
-
PortQRY
Looks like UDP 138 is blocked. AV causing it?? Not sure.
-
Event log errors? Check both for any AD related errors and post them, please.
-
Summary
Looks like generally speaking, replicaiton is working except for DFS and SYSVOL, which looks to melike a failed DFS migration. This will cause logon issues because GPOs are not accessible from DC6.
Note: With all due respect, due to the complexity of what's going on, something like this may take some time to fix, and it may be prudent to give Microsoft PSS a call to fix this. It's only about $275 for the ticket, and they will take all the time required
to fix it.
-