HagalazAI commited on
Commit
9283895
·
verified ·
1 Parent(s): 60cf555

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -1
README.md CHANGED
@@ -4,7 +4,7 @@ tags:
4
  - security
5
  - red-team
6
  - roberta
7
- license: apache-2.0
8
  datasets:
9
  - trendmicro-ailab/Primus-FineWeb
10
  metrics:
@@ -78,3 +78,13 @@ print(f"P(offensive) = {prob:.3f}")
78
  is_red = prob >= 0.515 # ← recommended threshold
79
  print("is_red:", is_red)
80
 
 
 
 
 
 
 
 
 
 
 
 
4
  - security
5
  - red-team
6
  - roberta
7
+ license: odc-by
8
  datasets:
9
  - trendmicro-ailab/Primus-FineWeb
10
  metrics:
 
78
  is_red = prob >= 0.515 # ← recommended threshold
79
  print("is_red:", is_red)
80
 
81
+ ## Source & License
82
+
83
+ This dataset is built from the Primus-FineWeb collection (trendmicro-ailab/Primus-FineWeb), which is itself made available under the Open Data Commons Attribution License 1.0 (ODC-By-1.0). When you redistribute or build on this data you **must**:
84
+
85
+ 1. Include an attribution statement, e.g.:
86
+ > “Contains data from trendmicro-ailab/Primus-FineWeb, used under ODC-By-1.0 (http://opendatacommons.org/licenses/by/1-0/).”
87
+ 2. Keep any existing copyright or license notices intact.
88
+ 3. Abide by [Common Crawl’s Terms of Use](https://commoncrawl.org/terms-of-use/) for the underlying crawled content (e.g. don’t use it for illegal or harmful activities).
89
+
90
+ If you’re republishing any of the “hacking” or offensive-security tutorials, you’re perfectly free to do so—just follow those three steps above.