|
Sawmill Discussion Forum
ashp

unregistered user
|
Jun-15-05, 06:58 AM (PDT) |
|
|
"Ongoing nightmare with rebuild/update of a profile."
| |
Hi there, This profile takes roughly 2 full days to build, so I don't want to rebuild it from scratch again. I did that, then immediately after did an update, and I get: #### Can't open file LogAnalysisInfo/Databases/profile/main/main_table/xrefs/21/segment0.dat (mode=wb) (No such file or directory)<br /><pre> Is there a way to regenerate just the xrefs? It takes a full day or so to uncompress all the logfiles and read them in. |
|
|
|
Printer-friendly page | Top |
|
|
dgilmore
Member since Nov-18-04
3846 posts |
Jun-15-05, 12:05 PM (PDT) |
 |
1. "RE: Ongoing nightmare with rebuild/update of a profile."
In response to message #0
 |
What you'd need to do is run from the command line as follows: SawmillCL.exe -p yourprofilename -a rcrt Run the command above from your Sawmill install directory, if you are using linux/unix/Mac OS the command format would be: ./sawmill -p yourprofilename -a rcrt to get a list of profiles from the command line run: ./sawmill -a lp or on Windows, SawmillCL.exe -a lp The "rcrt" action will build only cross reference tables from the command line without requiring re-processing of the log data. see this link for more info- http://sawmill.net/cgi-bin/sawmill7/docs/sawmill.cgi?dp+docs.option+webvars.option_name+command_line.action+webvars.username+samples+webvars.password+sawmill As a side note, depending upon the method you've used to compress the log data and the type of compression, Sawmill natively reads compressed log data directly. So for example if you've got log data compressed in a ".zip" archive you do not need to uncompress this, and then process the log data. You should be able to simply point the log source to the compressed archive and Sawmill will process it natively. Hopefully this will save you time in the future. However, we do not support all compressed file formats natively, see this link for info on what we support: http://sawmill.net/cgi-bin/sawmill7/docs/sawmill.cgi?dp+docs.faq.entry+webvars.entry+gzippeddata+webvars.username+samples+webvars.password+sawmill A few other questions... Do you see any errors in the TaskLog file as to problems with the build, or some error message related to a build failure? How much data are you processing using the internal Database? What platform are you building on? Did you run the build from the GUI, or from the command line? David Sawmill Product Support Team support@flowerfire.com |
|
|
|
Printer-friendly page | Top |
|
|
 |
ashp

unregistered user
|
Jun-16-05, 04:53 AM (PDT) |
|
|
2. "RE: Ongoing nightmare with rebuild/update of a profile."
In response to message #1
| |
Hi there, Thanks for the rcrt switch, that was greatly appreciated! I'm building on a Linux box, the logfiles are all gzips and left compressed, it's just each one expands to about a 250M file and they date back to 2004, so it takes a while to crunch through, I imagine I'm just hitting I/O and CPU limits. I didn't get any errors during the build, it seemed to build OK. Awkwardly, I kicked it off from the web interface and someone then unplugged the network cable on me, so I ended up just following the build by viewing the IPC/ file for the task directly. It's crunching through the xref stuff now, so hopefully that should sort out the first problem, then I can attempt another update. Previously it just stopped updating, cutting off at May 17th, despite all the logfiles being present and read by sawmill during an update. The session overview thing had stopped working as well, so I imagine something was corrupted in some way. Doing updates from the command line showed no errors, so I was out of ideas apart from a full rebuild. I think sometimes what happens is that a build or update is already running, and someone triggers a second one, and rather than detecting that the database is being modified, it tries to start a second one anyway. I wish I could maybe get sawmill modified to create a lock file so that can't happen.  |
|
|
|
Printer-friendly page | Top |
|
|
|
 |
|
 |
hmbmartin
Member since Apr-2-13
2 posts |
Jun-16-05, 07:49 AM (PDT) |
|
3. "RE: Ongoing nightmare with rebuild/update of a profile."
In response to message #1
| |
Tried this for a problem I was having on an IA64 platform and got this error: Sawmill 7.1.9; Copyright (c) 2005 Flowerfire #### Internal Error: Unknown progress step 'building_xrefs_separately'<br /><pre> Error occurred at: : Sf164d383f8fd9c8770b7f012c7bedb96.cpp:246 </pre> Cheers,
Sam samuel.martin@hp.com |
|
|
|
Printer-friendly page | Top |
|
|
|
 |
|
 |
ashp

unregistered user
|
Jun-20-05, 06:04 AM (PDT) |
|
|
6. "RE: Ongoing nightmare with rebuild/update of a profile."
In response to message #4
| |
Still more problems. I dumped the database, built from scratch, then tried to get the sessions to work again. Now I get: Can't open file LogAnalysisInfo/ReportCache/jessops/SessionCache/c113cb3e049e13d2120c2b96ab6e854d/page.expanded (mode=wb) (No such file or directory) ; Error occurred at: : /tulip1/sawmill/v7/production/src/utilities.cpp:2464; This differs from yesterday, when it gave me an out of memory errors, despite the RSS being around 400M and the box having 3G. Anyone seen these before? |
|
|
|
Printer-friendly page | Top |
|
|
|
 |
|
 |
ferrar
Member since Sep-5-01
3689 posts |
Jun-22-05, 01:19 PM (PDT) |
 |
9. "RE: Ongoing nightmare with rebuild/update of a profile."
In response to message #8
 |
It sounds like you're encountering a type of problem that a few others customers have seen, involving high memory usage in session reporting. Sawmill performs the session analysis in memory, which means that for very large datasets, it may not be able to successfully perform the session analysis at all, if you don't have enough RAM. For extremely large datasets (which you clearly have), it can take several GB of RAM to do a full session analysis, and even 3GB may not be enough. As an initial step in testing if this is the case, please try zooming in on a single day, than *then* doing a session report. If that works, but an unfiltered report does not work then this is probably an out-of-memory condition. We recognize that it's somewhat outrageous that Sawmill requires gigabytes of RAM for this analysis, and we are working to move the session analysis to disk, to fix this problem. In fact, that latest internal code for Sawmill 7.2 includes this feature--session analyses are done primarily on disk, rather than in memory. This is highly experimental at this point, and I couldn't possible recommend it for a production environment, but that's the long-term solution: Sawmill 7.2 will fix this excessive memory usage issue (and a few other excessive memory usage issues as well). If you try with a one-day filter, does it work? |
|
|
|
Printer-friendly page | Top |
|
|
|
 |
ashp

unregistered user
|
Jun-23-05, 04:07 AM (PDT) |
|
|
10. "RE: Ongoing nightmare with rebuild/update of a profile."
In response to message #9
| |
I see, thanks for the explanation, I guess we'll have to wait on 7.2 and just restrict to ranges for the time being. I did however try with a single day and got: 3. Generating report (1) nan% 00:00:00 00:01:29 It stuck at nan% forever. I'll try it again with two days, see if that makes a difference. |
|
|
|
Printer-friendly page | Top |
|
|
|
 |
ferrar
Member since Sep-5-01
3689 posts |
Jul-15-05, 03:32 PM (PDT) |
 |
11. "RE: Ongoing nightmare with rebuild/update of a profile."
In response to message #10
 |
If you'd like to play with the 7.2 pre-release, it is available from http://sawmill.net/prerelease.html . This is a very early pre-release (not anywhere near as advanced as "beta"), with several known bugs and countless unknown ones. In particular, I am fairly certain there are errors in the session calculation, though it's "close" (i.e. comparing Session Overview from 7.1 to 7.2, I get different numbers, but usually off by just a few, so it's probably some fairly small bug with not removing page reloads, or missing timeout intervals, or something else). Still, if you'd like to live on the edge, there it is. |
|
|
|
Printer-friendly page | Top |
|
|
|
|
|