I downloaded your HTML extract and found one problem: it did not follow many of the links to subsidiary pages such as "Title Note" and "Article Note". For an example, see 15-10-26. which has the following Title Note:
CROSS REFERENCES. --Criminal Justice Coordinating Council, 35-6A-1 et seq. Establishment of county law libraries, 36-15-1 et seq. Court-martial jurisdiction, 38-2-370 et seq. Designation of courts which possess jurisdiction over traffic offenses, and procedure in such courts, 40-13-1 et seq. Indictment and punishment of judge of probate court for malpractice, partiality, conduct unbecoming office, and other offenses, 45-11-4.
LAW REVIEWS. --For article, "The Majority That Wasn't: Stare Decisis, Majority Rule, and the Mischief of Quorum Requirements," see 58 Emory L. J. 831 (2009).
RESEARCH REFERENCES
Am. Jur. Trials. --Judicial Technology in the Courts, 44 Am. Jur. Trials 1.
According to the US Supreme Court, these notes are part of the official code and thus not protected by copyright. Citizens are held accountable to the interpretations given in these notes, and Georgia has made them part of the "official" code, and thus they must be available to all citizens.
Can you update your code to extract these notes as well? Thanks!