This system and its software wasn't without its minor quirks, but it did work pretty well.
Until recently.
A couple of months ago he started getting anomalous readings from the unit - and a day or two later, it failed to provide any current readings at all but it still read the mains voltage. Upon opening his breaker panel he could detect the strong smell of burnt glass-epoxy circuit board so he knew that the unit had catastrophically failed in some way.
Thus, it landed on my workbench.
As it was, I could hear parts rattling about - almost never a good sign - and after using the "spudging" tool to get it apart I could see the problem: Two arrays of incinerated 39 ohm surface mount resistors.
Lightning damage? I think not!
Based on a cursory overview, this unit appears to directly rectify the 240 volt mains and apply it to a switch-mode converter - and this portion of the circuity appeared to be undamaged - a fact borne out by the owner who said that it was still reporting mains voltage when he pulled it from service. What appeared to be "smoked" were the shunt resistors for both sets of CTs (current transducers) - and the question came up: "How the hell did that happen?"
Lightning damage or a power line transient seems unlikely considering the very nature of how CTs are connected and used:
- First off, CTs are completely isolated from the current-carrying conductor that they are measuring, so some sort of "arc 'n spark" of mains voltage to the sensor input would seem to be out of the question. I would expect that the stand-off voltage of the CT on the piece of wire that was being monitored would be in the high kilovolt-range - and if there had been enough voltage to break down the insulation not only would there be visible evidence.
- This damage appears to be longer-term than a brief transient, having occurred over a long enough time to thoroughly char the board as seen in the pictures. A very brief, high-energy transient would likely have blown components clear off the board and, at the very least, physically damaged other components in the signal path.
- He has a "whole house" surge suppressor installed - a "good" one: Certainly that would have suppressed a transient capable of causing direct damage via the CT input - assuming that it was likely at all. Had a massive transient actually happened, one would expect that the suppressor would have shown signs of "distress".
- An event capable of this sort of damage - again assuming a transient - would have surely caused other damage to something in the house: This was not the case.
- He has solar inverters at his house. At the time of damage, these would have surely registered a transient event, had their been one.
- Considering the time of year - and the weather involved - the probability of lightning falls into the "bloody unlikely" category - particularly since the weather was fine in the day or two that it took for it to go from "sort of working" to "failed" status.
Figure 4: The most badly damaged of the set of sense resistors. (Yes, pun intended!) Click on the image for a larger version. |
So, what happened?
At this point, it's really not possible to be completely sure, but it looks as though there may have been either a fault in both CTs (but how likely is that?) and/or there was a deficiency in the design of the monitoring board.
What are CTs?
CTs (current transducers) are nothing more than simple transformers: One passes the wire to be monitored through the middle of a toroidal core and a voltage is induced on the many windings of the secondary wound around it: The current through the wire in the middle is directly proportional to the (lower) current that flows and the way this is typically done is to terminate the secondary winding with a resistance. By using Ohm's law and measuring the voltage across that resistance, the current on the wire can be calculated.
It is absolutely imperative that a CT be terminated with a low-ish resistance as leaving it open-source can develop a tremendous voltage. But, there is a potential problem: Current transducers are very nearly an ideal current source - that is, whether you simply short its output together or run it through even a fairly high-value resistor, the current will (ideally) be the same - but knowing Ohm's law, the higher the resistance, the more voltage drop for a given current - and the more power being dissipated in the shunt resistor(s). Clearly, if the shunt resistance had increased, something terrible would be bound to happen.
What I expect happened was this:
- The shunt resistance - which appears to consist of ten 39 ohm resistors in parallel (for 3.9 ohms)may have been of marginal total dissipation rating. Under a moderate load, it's possible that these resistors have been running quite warm and over time, they have degraded, slowly increasing in value.
- As the value increased, the calibration would have started to drift: Whether or not that happened here over a long period is unknown - but the owner did report that it took a couple of days for the unit to go from sending alarms about nonsensical readings to the total loss of current readings.
- As the resistance went up, so would the power dissipation of the sense resistors. Because CTs are essentially constant current devices, as the voltage increased, the power being dissipated by those resistors would also increase. The the original failure mode was that the resistance was increasing due to these resistors running hot, the increased heat would have likely caused the previously slow-moving failure to accellerate.
- At some point, a cascade failure would have occurred, with the voltage skyrocketing - and the current remaining constant: This would certainly explain the evidence on the board.
What was the problem, then?
Assuming that there was not any sort of inadequacy in the original circuit design, I'm at a loss to explain the damage to the board.
What seems to have been the issue was, in fact, stressed components on the circuit board and/or a failure of the CT itself (or even the wrong CTs being supplied) but it seems unlikely that both CTs would have failed in exactly the same way.
Barring other information, I'm tending toward believing that a gradual degradation of the shunt resistors - possibly owing to the original components being thermally stressed under normal conditions - was a problem, culminated with a cascade failure at the end.
It would be very interesting to have a peek inside other units of the same model and revision that have been installed for a while to see if they show thermal stress related to the shunt resistors. A quick perusal on the GoogleWeb did not immediately reveal this to be a common problem, so it is possible that this is some sort of freak incident.
Unless he decides to get another unit to replace this one and a comparison is done, we'll probably never know.
This page stolen from ka7oei.blogspot.com
[End]